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INTRODUCTION 


This volume contains papers read at the Conference on Intuitionism and 
Proof Theory held at the State University of New York at Buffalo, New 
York, in August 1968. The conference was organized by us with the coop- 
eration of the Department of Mathematics at S.U.N.Y., and with financial 
support provided partly by the National Science Foundation (Social Sciences 
Division) and partly by S.U.N.Y. We take this opportunity to thank Miss 
Melstrads of the National Science Foundation, Mr. Blackstone, Director of 
Summer Sessions at Buffalo, and Mr. Snell, Dean of the Graduate School at 
Buffalo, for their financial support which made the conference possible; as 
well as innumerable students and faculty at Buffalo, who helped us in innu- 
merable ways. 

The organization of the volume is as follows: Papers VI-XVIII deal with 
Intuitionism and papers XVII-XXXI with Proof Theory; the two related 
papers XVII and XVIII of Prawitz and Scarpellini represent the intersec- 
tion of the two areas. The first five papers, including Yessenin-Volpin’s in- 
augural address, represent various tendencies in constructive mathematics 
which do not fit squarely into the dichotomy intuitionism-vs.-formalism. 

There are some discrepancies between the list of papers contained in 
this book and those actually read at the Conference. In particular, Takeuti 
gave a course of ten lectures on proof-theory, and Troelstra ten on intui- 
tionism; the Takeuti lectures will be published in an extended form by 
North-Holland, and Troelstra’s have already been published as vol. 95 in 
the Springer lecture-note series (“Principles of Intuitionism”’). A paper by 
Gilmore (‘‘Formalization of Naive Set Theory”) was withdrawn because 
the system presented turned out to be inconsistent, and one by Sabbagh 
(“Extensions Finies de Fragments du Calcul Propositionnel Intuitionniste”’) 
because its results were found at the last minute to be properly included in 
those of McKay, J. Symb. Logic 33 (1968) 258-264. Papers by Krasner 
(“Definitionism”) and De Jongh (“Elementary Remarks on Some Con- 
cepts and Methods of Intuitionism’’) were not received in time for inclusion 
in this volume. A paper by Hull (‘“Counterexamples in Intuitionistic Anal- 
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ysis”) had already been accepted for publication in the Zeitschrift fiir 
Mathematische Logik before the conference was held (Z. Math. Logik 15 
(1969) 241-246). Kleene’s paper (‘‘Formalized Recursive Functionals”) 
was not included because of its length; it has subsequently appeared as 
number 89 of the Memoirs of the American Mathematical Society (1969). 
The paper XX of Feferman contains only a part of the material presented 
at the conference. The rest is contained in a privately circulated manuscript 
entitled “Ordinals Associated with Theories for one Inductively Defined 
Sour 
Yessenin-Volpin was invited to give the opening address, but unfortu- 

nately could not come; he submitted the long paper with which this volume 
opens, an which stands here as an opening address. At the Conference 
itself an alternative opening address was read by Heyting (paper VI in 
this volume). Bernays too was unfortunately unable to come; his paper 
(paper XXV) was read by Howard. 

AKIKO KINO 

JOHN MYHILL 

RICHARD VESLEY 


SECTION A 


NEW DIRECTIONS 


THE ULTRA-INTUITIONISTIC CRITICISM 
AND THE ANTITRADITIONAL PROGRAM 
FOR FOUNDATIONS OF MATHEMATICS * 


A. S. YESSENIN-VOLPIN 


1. The aim of this program is to banish faith from the foundations of 
mathematics, faith being defined as any violation of the law of sufficient 
reason (for sentences). This law is defined as the identification (by definition) 
of truth with the result of a (present or feasible) proof, in spite of the 
traditional incompleteness theorem, which deals only with a very narrow 
kinds of proofs (which I call ‘formal proofs’). I define proof as any fair way 
of making a sentence incontestable. Of course this explication is related 
to ethics - the notion fair means ‘free from any coercion and fraud’ — 
and to the theory of disputes, indicating the cases in which a sentence is 
to be considered as incontestable. Of course the methods of traditional 
mathematical logic are not sufficient for this program: and I have to enlarge 
the domain of means explicitly studied in logic. I shall work in a domain 
wherein are to be found only special notions of proof satisfying the 
mentioned explication. In this domain I shall allow as a means of proof 
only the strict following of definitions and other rules or principles of 
using signs. I shall indicate several new logical theories of this kind dealing 
with modalities, tenses and voices of verbs, with identifications and distinc- 
tions, with rules of attention and neglecting, with general principles of 
semiotics, etc., and with the help of these theories I shall reconstruct 
arithmetic and prove the consistency of Zermelo-Fraenkel set theory (ZF) 
with any finite number of inaccessible cardinals. This program has been 
studied by me in full detail. I don’t insist I have eliminated all difficult 
questions — but [ have introduced in my proofs no essentially new hypothesis 
and even if I am using some kind of hypotheses, these are all of a linguistic 
nature, and are common to all thinkers. I need no hypotheses concerning 
infinity. 

But this program is too large for a short report, and here I shall confine 


* Opening address. 
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myself to the exposition of my ultra-intuitionist criticism and of some main 
features of the positive program. 


2. I begin with a criticism of the following traditional assumptions under- 
lying the body of modern mathematics. 

Tl. The uniqueness (up to isomorphism) of the natural number series; 

T2. The existence of the values of primitive recursive functions (prf) 
for every system of arguments for an arbitrary natural number series. 

If for each system x,,..., X, of arguments in a natural number series N 
the value of p(x,,.. ., ¥,) exists in N, I shall call N closed with respect to . 

T3. The principle of mathematical induction from zn to n’. 

T4. If the axioms of a formal system are true and the rules of inference 
conserve the truth then each theorem is true.* 

T5. The meaningfulness of the relations of identity and distinctness. 

I criticize these assumptions above all simply on the ground they are 
assumptions. Their incontestability has never been established and this is 
sufficient reason to doubt them. But this radical line of criticism does not 
help to establish the connections between these assumptions, and J shall 
examine them more closely as follows. 

Above all, proposition Tl uses a quantifier ranging over a domain 
‘including natural number series. It is too obscure to place at the beginning 
of mathematics. 

I accept the traditional intuitionistic criticism of Brouwer and go further. 
I ask: why has such entity as 10’? to belong to a natural number series? 
Nobody has counted up to it (10'? seconds constituting more than 20 000 
years) and every attempt to construct the 10'?-th member of sequence 
0, 0’, 0”, .. . requires just 10'* steps. But the expression ‘n steps’ presupposes 
that ” is a natural number i.e. a number of a natural number series. So this 


* The assumptions T1-T4 have been criticized by many people, e.g. Borel, Frechet, 
Mannoury, Rieger and van Dantzig doubted T1, or the ‘finiteness’ of very great numbers 
like 104°*° (van Dantzig). Lusin spoke about T1 asa ‘poignant problem’. The independence 
of T2 is a commonplace in the theory of primitive recursive functions; and doubts about 
the finiteness of 10'°*° lead immediately to a rejection of T2. H. Poincaré in his ‘Science 
et Méthode’ (1908) wrote that many people have rejected T3 because they found a vicious 
circle in its substantiation: he does not name these scientists, but the most plausible 
conjecture is that they spoke about the circle involving T3 and T4, as discussed in the 
text below. 

But this criticism has never been made in a systematic manner and J never heard of 
anybody who has criticized T5 or T8. 
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natural attempt to construct the number 101 in a natural number series 
involves a vicious circle. This vicious circle is no better than that involved 
in the impredicative definitions of set theory: and if we have proscribed 
these definitions we have to proscribe the belief in existence of a natural 
number 10!7, too. 

Let us consider the series F of feasible numbers, i.e. of those up to which 
it is possible to count. The number 0 is feasible and if n is feasible then 
n < 10’? and so 7’ also is feasible. And each feasible number can be ob- 
tained from 0 by adding ’; so F forms a natural number series. But 10! 
does not belong to F. 

Nevertheless the traditional natural number series containing 10!? is 
generally supposed to exist. And if we accept it there are at least two 
different natural number series. 

This is only the beginning of my criticism. I don’t really believe in the 
existence of a series containing 10'? — I shall prove the possibility of such 
a series — and I shall not need the notion of feasibility or any other empirical 
notion for my proof of the possibility of distinct natural number series. 
But for the present let me continue the consideration of F. Since 10 and 12 
are feasible numbers, 10!? is not such a number and the function a? is a 
prf, T2 is violated for F. 

This is not strange. The independence of T2 is many times indicated 
in the literature. The inductive definition of the natural numbers takes no 
account of the property T2 of a natural number series. 

The mathematicians use the following way of constructing natural number 
sequences: one chooses a number dy and defines a,+,, in terms of a,. Here 
I am interested only in the case a,,, > a, for all n. I call a natural number 
series N regular if it is closed with respect to a, for each such sequence a,, 
considered as a function of n. Of course, the assumption that every natural 
number series is regular is even stronger than T2: and so it is wrong. 

I doubt the possibility of a regular natural number series. This notion 
depends on arbitrary sequences of numbers: and so it is connected with 
problems like that of the continuum. It is natural to consider the notion of 
T-regularity, T being a class of number sequences, and one may expect 
to obtain a T-regular series for certain concrete T’s. 

Of course, with the help of T3 (for arbitrary induction properties) one 
can prove the regularity of every natural number series but this is an 
argument against the acceptance of T3, at least in its most general form. 

I call T4 the locality principle for proofs, — because it asserts that the 
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property of a tree-figure of being a proof depends only on the local proper- 
ties of this figure — more strictly, on the properties of its summits and nodes, 
the only ‘integral’ property consisting of the fact that the local properties 
are satisfied by each summit and node. A similar principle may be formulated 
for deductions from arbitrary premises in terms of ‘truth relative to the 
premisses’. 

It is clear that T4 depends on T3. Vice versa, T4 is used in obtaining 
P(m) for arbitrary m from P(O) and Va(P(n) > P(n')). This vicious circle 
forces me to reject both T3 and T4 in the deepest questions of foundations 
of mathematics. It is essentially on these grounds that I am not searching 
for any axiomatic theory in my program. 

This rejection of the axiomatic method does not mean the expulsion of 
the axioms and rules of inference as such. But I find that these rules are not 
sufficient to obtain a proof. We always need some supplementary ‘rules of 
guarantee’. I shall consider the traditional tree-figures of logical proofs and 
deductions too, but I shall not consider them as such and I shall call them 
the bodies of proof or deduction or more briefly the demonstroids and the 
deductoids. The rule of guarantee requires that a demonstroid or deductoid 
is to be accompanied by an establishment of its convincingness (which I call 
also the soul of the proof or deduction). This establishment must constitute 
a proof of the fact that for the given demonstroid the applications of rules 
of inference considered together form a way of leading only to truths (and 
similarly for deductoids and relative truths, i.e. truths relative to the 
premises of deductoids). This is a new proof, and occasionally it may consist 
of a new demonstroid with its own establishment of convincingness. But 
in order that this procedure stops at some time, one has to have some 
primitive kind of proofs, independent of demonstroids. I call these proofs 
the protodemonstrations. The use of definitions and certain logical rules 
from domains deeper than the predicate calculus (modality theory, etc.) 
are the only means available in protodemonstrations. 

The idea of establishment of convincingness is not completely strange 
even for traditional mathematical logic. Everyone understands that each 
tree-figure of proof requires a reference to T4. This reference (perhaps 
supplied by another reference to T3) is generally considered as an establish- 
ment of convincingness: and it is so uniform that it is always omitted. But 
I criticize this establishment of convincingness for its vicious circle (with 
T3 and T4). So I reject this traditional establishment of convincingness. 
I consider a special genetic theory dealing with these establishments. 
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3. Now I go further with my criticism of traditional assumptions. I shall 
consider T5. I recall the series F of feasible numbers. N denotes a natural 
number series with 10'?. 

It may seem quite natural to say that some numbers of N are feasible and 
that the number 10’? is not. I said this above myself. But this way of con- 
sidering the numbers of F leads to the situation that the finite set of numbers 
{0, 1,..., 10'7} of N contains an infinite part F. Perhaps it is not absurd 
but it leads to many difficulties. If somebody looks over this set of N- 
numbers he exhausts this infinite part, but I prefer to say an infinite process 
cannot be exhausted. This difficulty disappears if we distinguish between the 
F-numbers and the N-numbers equivalent to them. But sometimes it will 
be more convenient to identify these numbers belonging to different series. 
It is a very common thing; sometimes we identify objects that we distinguish 
in other cases. The rules of doing so must be established in a rigorous way. 

Already Heraclitus pointed out that the notion of identity is not com- 
pletely clear. But mathematicians prefer to proceed as if Heraclitus had not 
lived. I cannot continue in this way. The situation when an infinite process 
can be imbedded in a finite object is an ordinary one in investigations of 
distinct natural number series, and I shall need an apparatus for the explicit 
consideration of all identifications used in such cases. 

First of all I maintain that the relation of identity or distinctness has no 
other meaning than that two objects have been identified or distinguished. 
There are two important kinds of atomic actions: identifications and 
distinctions. These actions are commonly used according to certain rules and 
they can be forced by these rules. Actions do not need to possess any 
meaning but they may have an aim. The rules also can be accepted as a 
means to achieve some aim. A very common aim of the identification of 
two objects a and b is to prepare the acceptance of the sentence ‘a and b 
are identical’, and similarly with distinctions. I shall consider a general 
theory of collations, i.e. identifications and the distinctions. A more general 
theory of rules and aims is needed for it. 

Generally, one can identify or distinguish any two objects in two arbitrary 
occurrences of them. But collations play in our arguments a role similar 
to that of assumptions, and they must be made quite explicit or at least 
according to explicitly stated rules. 

You can identify Fermat’s last theorem with the sentence 2x2 = 4. 
Then, the last sentence being proved, you can say Fermat’s theorem is 
proved too. This is not awkward if you express it quite explicitly: if Fermat’s 
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last theorem is identified with the proved sentence 2x2 = 4, it is proved. 
If the identification were an ordinary one, it could be not mentioned; 
though in identification theory all rules of the ordinary identifications are 
to be mentioned. 

Although one can identify any two objects in arbitrary occurrences of 
them, there are cases of awkward identifications. E.g. such is every identifica- 
tion of a present object with an absent one. If you consider identifications 
used without explicit mention you have to proscribe these dangerous 
identifications and you have to require that no two objects are identified 
before they appear. 

Let us consider the series F and N once more. Each F-number will be 
identified with the equivalent N-number — but this presupposes the 
appearance of the F-number. So the identifications constitute an infinite 
process P. This process effectuates an imbedding of F in the finite part 
{Opole ets 10%33 of N: but this imbedding is infinite and you never can say 
it is over or that F is imbedded in {0, 1,..., 10'?}. 

I refer to as a Zenonian situation every case in which the events of an 
infinite process are to be identified with the parts of a finite object; and this 
finite object I call the field of a Zenonian situation. It is useful to establish a 
connection of this notion with the Zenonian paradox of the runner: A 
runner cannot get from the point A to the point B; for if he does then the 
infinite process of runnings of consecutive halves of this distance is over. 
But the events of this infinite discrete imaginative process are not necessarily 
the same as the parts of the real continuous process of running. (The 
continuous character of the process would be more evident if the running 
is replaced by the flight of a ball.) They are only identified with them: 
but the series of these identifications is an infinite process, and it can never 
be over as Zeno supposes in his argument. A similar criticism explains the 
paradox of Achilles and the tortoise. 

The identification of two composite objects f(a,,..., d,) and g(b;,..., Bn) 
is often produced as the result of a structural procedure of identification. 
The structures of these objects are supposed to be identified (which pre- 
supposes the identification of n in both occurrences). One identifies f with g, 
a; with 6; for alli = 1, ..., n, then one makes the establishment of exhaustion 
(of all f, g, a,,5,,...,4,,6,), and only then does one identify the objects 
f(a,,...,4,) and g(b,,..., b,). That is a structural identification. 

The described Zenonian situation with the field {0, 1,..., 10'7} is not 
impossible if one considers the series of identifications as an infinite one. 
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It is only an infinite imbedding of the infinite process F in the finite field. 
One may say this means the field is finite in some occurrences and infinite 
in other ones. I call it the Zenonian situation in the wider sense. 

But generally one considers a finite composite object in many occurrences 
of it and one identifies the object in these occurrences and one thinks these 
identifications may be considered as structural ones. Given the Zenonian 
situation one cannot carry out the structural identification of its field (with 
an infinite process imbedded in it) on the ground that the establishment of 
exhaustion becomes impossible. But if the identifications are not stated in 
an explicit manner it is impossible to notice the fact; and one talks about 
the field as if this impossible structural identification were carried out. That 
would be the Zenonian situation in the narrower sense. This situation is 
impossible — though without the analysis of identification one can easily 
overlook it. 


4. Before going further I introduce now some terminological conventions. 

The events of a natural number series N I call the N-numbers. If 
N,(0,,0;,...) and N,(0,,03,...) are two such series, then I call 0, and 0, 
equivalent and I denote it by 0, ~ 0, ; if m~n (where me N, andne N,) then 
I permit it to be said* that m’ ~ n’ also, and the relation m ~ n is always 
to be established in this way. If for each N,-number m there is an equivalent 
N,-number n, I say that N, is not longer than N, and I denote it by Ny < N3. 
If N, < Nz, and N, < N,, then I call the series N, and N, isomorphic. 
If N, < Nz and N, is not isomorphic to N,, I say that N, is shorter than 
N, — but intuitionistically this does not mean the existence of an N-number 
equivalent to no N,-number. If N, < N, and there is an N3-number q 
equivalent to no N,-number, I say that N, is shorter than g and I denote 
it by N, < q; in this case I say also that N, is explicitly shorter than N, 
(or that N, is explicitly longer than N,) and I denote it by N, < Np. 

If one tries to establish an isomorphism between two series N, and N, 
by considering the pairs <0,, 0,>, <0), 05>, ..., since the consideration of 
each pair ¢(m, n> presupposes the appearance of m and n one obtains only 


* Usually this is expressed by ‘if man then m’ x n’’. But such expressions overlook 
the fact that m’ ~ n’ can appear only after m~n has appeared. This negligence may 
lead to the platonistic representation of a realm of all equivalences of the kind m= n. 
Of course I have to reject this, and I prefer to speak here about the rules governing the 
appearance of the equivalences rather than speaking of the equivalences themselves 
without mentioning the rules. 
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the consideration of the third series of these pairs which is not longer than 
each of N, and N, and nothing more. 

I find it useful to consider here the analysis of an ordinary supposed 
justification of T3. Given P(0) and Va(P(n) > P(n')) one indicates the 
possibility of obtaining the deductions for P(0) > P(0’) and P(0’), then for 
P(0’) > P(0’’) and for P(0’) and so on. This justification of T3 uses: 

(a). An application of Carnap’s Rule 


ene (Ith eee, 


fe) Ym P(m) 


where m denotes an arbitrary value of the variable m. 

(b). The hypothesis of potential feasibility according to which, given a 
method of construction requiring an arbitrary finite number m of steps, 
each step being feasible on the supposition that all preceding steps are 
fulfilled, one considers as feasible the result of this construction. 

(c). The localization principle T4 for deductions. 

(d). A principle of parallelism according to which at the m-th step of the 
described deduction one obtains just the sentence P(m). 

(e). The features of the interpretation of signs V and > used in the 
justification of the logical steps leading from P(0) and Va(P(n) > P(n')) 
to each P(m). 

(f). The consideration of the truth of VmP(m) (relative to P(0) and 
Vn(P(n) > P(n'))) as denoting the deductibility of each P(m) (from the 
premisses just mentioned); this consideration is used in the justification 
of (a). 

(g). The indirect clause of the inductive definition of a natural number 
(leading to the conclusion that each m can be obtained from 0 by adding ‘). 

I should mention also the fact that the result of the construction used 
for each m is a deduction figure for P(m) (to which T4 is to be applied). 
I think one can obtain this fact from the definition of a deduction figure, 
the application of (b) and a kind of argument similar to (d). 


5. Now I describe briefly the structure of this antitraditional program: 

It contains a central nucleus consisting of two theories: the ontological 
theory and the genetic theory and a consistency proof for ZF (and some of 
its extensions) based on these theories. The ontological theory deals with the 
substantiation of the possibility of natural number series and other discrete 
processes required for this consistency-proof; and the genetic theory deals 
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with the notion of proof for sentences related to the events of distinct 
processes. The genetic theory deals especially with the elaboration of the 
notion of the establishment of convincingness for given processes called the 
studied processes. Also some fundamental theories called prototheories are 
needed in the central nucleus — these are e.g. collation theory, the modality 
and tense theories sketched below, and some others. Generally, only proto- 
demonstrations are available as means of proving in the prototheories and 
even in the ontological and in the genetic theories. Outside the central 
nucleus I shall have several extreme directions of the program concerned 
with the substantiations of some hypotheses occurring in the theories of 
the central nucleus. These hypotheses are implicitly contained also in the 
traditional theories, or in a general platform underlying them; at any rate 
they have nothing to do with impredicative definitions or other vicious 
circles occurring in the foundations of all traditional theories. The proto- 
demonstrations are the only available means of proof also in the extreme 
directions, and the gap between these and the prototheories may be a 
temporary one. If the foundations of ZF were the only task of the whole 
program I could say that the main problems of the extreme directions 
concerned with the substantiations of the mentioned hypotheses are essen- 
tially settled today. 


6. Now I continue the list of traditional assumptions. 

T6. The possibility of neglecting modalities and aims in foundations of 
mathematics. 

T7. The possibility of neglecting tenses, voices and moods of verbs. 

T8. The possibility of neglecting the rules of attention and neglecting. 

T9. The hypothesis of potential feasibility. 

T10. The division of theories into object theories and metatheories. 

Tl1. The postulates of the intuitionistic predicate calculus. 

The reasons for my criticism of T6 and T7 are connected with the previous 
analysis of traditional assumptions. 

Evidently when one applies T9 (see (b) above) one confuses the feasibility 
of construction with the supposition that the construction is fulfilled. 
E.g. for an arbitrary m one concludes from (b) and (d) above that the 
deduction of P(m) is feasible — but that does not suffice to obtain the deduc- 
tion of P(m’) with the help of P(m) > P(m’), for in order to obtain it one 
has first to obtain the deduction of P(m), and that is more than the assertion 
about its feasibility. Evidently there is a modal principle justifying this 
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transformation of the possible into the real in the foundation of mathematics 
—I shall call it the principle of modal fulfilment and analyse it more closely 
below. The exact formulation of this principle requires a theory of modalities. 

There is another thing which makes modal investigations necessary in 
the ultra-intuitionistic program. I mentioned the necessity of making 
explicit the rules of collations; and this requires a systematic treatment of 
rules and aims, which leads to a development of a theory of modalities. 

In virtue of the traditional intuitionistic criticism we have to consider 
the natural number series not as accomplished totalities but rather as 
infinite processes. But one cannot speak exactly about any process without 
using tenses in order to discern the past events from the future ones. For 
a natural number series, at each stage, its infiniteness belongs to the domain 
of the future and so one has to expose the rules of transforming the future 
tense into the present or the past in reasonings. The theory of tenses is 
closely connected with that of modalities, and in the course of the develop- 
ment of these theories voices and moods must also be taken in consideration. 

I introduce modal characteristics in the notion of natural number series. 
I call such series necessitary, real or eventual according to the modality 
‘necessary’, ‘factually’ or ‘possible’ with which the future appearance of 
n’ after n is asserted. Similar characteristics are to be used in connection 
with the notion of closedness of a series with respect to a function , with 
the relation N, < N, (according to the modality with which the existence 
of ge N, with N, < q is asserted) and so on. The notion of finiteness I 
connect with that of an end (but not with natural numbers) and one can 
distinguish the modal characteristics of finiteness etc. 

The theory of modalities will be closely connected with semiotics, i.e. 
the general theory of signs. 

In semiotics I consider the act of indication as central. An indication is 
an action by which its author shows to the addressee a point with the help 
of a sign. There is a denotational connection between the sign X and every 
point x of an indication and I denote this connection by Y¥ = x. 

A language is a method or system of indications, it is defined by means 
of constructing signs and by denotational connections. 

A character or tactic is a system of rules, i.e. of propositions expressing 
permissions and demands (see below). If a tactic is expressed in a language 
I call it a method. A way is a tactic or a method. 

The notion of connection is a primitive one. As with all primitive notions, 
its use is to be described by some way, i.e. by some method or at least by a 
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tactic. There are many tactics for following connections — they correspond 
in particular to the different kinds of connections. There are aim and 
denotational connections and the tactics for following them I call the tactics 
of attention. Every other tactic for following connections is subject in a way 
to a tactic of attention. 

To follow a tactic for following connections is to accept such sentences 
as ‘a is connected with 6’. Every sentence of this form is always connected 
to such a tactic x, and if some rule of x prevents the acceptance of the 
sentence ‘a is connected with b’ I say that ‘a is strange to b (relative to x)’. 

If a sign is made by means of a language I call it the name of its points 
in that language and the points I call the senses of that name. 

Each sign is used with at least one tactic of attention permitting one to 
notice that sign, and with a tactic of collations permitting one to identify 
it in its occurrences. I call a sign clear if there is an accepted tactic for 
following denotational connections with it. All points strange to the sign 
(relative to this tactic) are always to be neglected as its points. So a sign 
(in particular a name) is always a sign for its non-strange points only. 
A clear sign I call definite if it is necessary that it has just one point. The 
definiteness of a sign depends on a tactic of collations. 

A class is an indefinite name; its elements are the fixed values of its senses. 
(There are rules of using these explications. The idea is to identify the class 
defined by a name with that name; but that is not always convenient. But 
the use of the word ‘class’ is always to be connected with an indefinite name. 
Of course this use of the word ‘class’ has nothing to do with classical 
set-theoretical hypotheses on the existence of classes.) 

I call direct those occurrences of a sign in which the réle of the sign is 
confined to its playing a part as a sign for its points, and to the participation 
in following those tactics of attention and collations which make it a 
sign. (The réle of an object is defined as a class of aims and obstacles to 
which the object is attached by a tactic for following connections.) The rule 
of interchangeability of synonyms is suitable only for direct occurrences of 
signs. So the interchangeability of the terms ‘indefinite name’ and ‘class’ 
applies to direct occurrences of these terms only, and can be limited by 
some further rules useful for a text. These rules may be replaced by rules for 
distinguishing somewhere the senses of the both terms. All this question is a 
typical example of how closely the senses of different terms may be inter- 
connected. At the first sight even vicious circles must occur in these explica- 
tions but one can avoid them by distinguishing the senses of the same term 
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in different occurrences, in the hope that the system of explications gives a 
reduction for these terms attached to their occurrences. 

A collection is a finished class (i.e. a class whose elements are exhausted, 
which may be specified by means of collation theory). 

There are the following three fundamental semiotical principles: 

S1. Given a collection of indications {aj > a,, a, > a2,...,4,-1 > a,} 
in a language L the indication ay — a, also belongs to L. 

It is a rule for restricting languages. It enables us to follow the denotational 
connections. One uses this rule each time when the sense of word in a new 
occurrence is to be identified with its sense in a former one. 

S2. If the connections f > F, a, + A,,..., a, > A, belong to a language 
L, F being an operation applicable to the objects 4,,...,A,, then the 
notation f(a,,...,a,) > (the result of applying F to A,,...,A,) also 
belongs to L; some specifications of the rdéle of the order ‘a,,..., a, may 
be introduced, and ‘f(a,,...,a,)) may be replaced by another record 
which enables one to reconstruct it. (The principle of structural parallelism.) 

Parameters are indefinite indecomposable names. 

S3. When a text is accepted the result of fixing some collection of param- 
eters in it by their admissible values is also to be accepted; but this 
operation of fixation may require grammatical agreements in the text ob- 
tained by the fixation. 

I have to draw special attention to the agreements in the verbal tenses. 
The fixed values may in turn depend on parameters or even be parameters. 


7. Now I shall describe the main features of my modality theory. 

First of all, it cannot be an axiomatic theory on the grounds of the 
vicious circle with T3 and T4. Also it must precede other logical theories, 
in particular that of the applicability of the axiomatic method, i.e. the 
genetic theory. So all proofs in it must be protodemonstrations. 

I divide the modalities into three categories (necessary, real and possible) 
and four groups: deontic modalities connected with rules, e.g. permissions 
and demands, aim modalities connected with aims (‘possible’ means 
‘possible for an aim’ i.e. the aim might be achieved with this means, and 
‘necessary’ means ‘needed for some aim’), and the two kinds of alethic 
modalities — the organic modalities connected with methods or ways (I can 
= I am able, I must = I am forced by rules of a way) and the epistemic 
modalities (‘perhaps’ or ‘certainly’). The epistemic possibility of A may 
be interpreted as the organic possibility of the continuation of reasoning 
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after A is accepted. The epistemic necessity of A means the organic necessity 
of the acceptance of A forced by rules of a way. For each group there is 
the modality ‘real’ used, e.g. in the abstract consideration of an action in 
connection with rules in order to investigate the lawfulness of the action 
without any assumption about the alethic reality of the action. 

A situation is a class of (senses of) sentences. Generally, a situation S 
describes some conditions indicated in a way. The conditions may be 
described by a situation with different degrees of precision, the degree being 
sufficient or not according to the aims of the description. 

The modalities may apply to the situations and to the propositions 
— sentences or names of actions. The deontic, aim and organic modalities 
apply to the names of actions which may be propositions with one verb 
in the infinitive mood. The epistemic modalities apply to sentences (so the 
verb stands in the indicative mood). 

When a modal operation is applied to a sentence or to a name of action 
it is always to be accompanied by the name S of one or more situations. 
(An occasion possible in one situation may not be possible in another, etc.) 
That is one of the characteristic features of my modality theory. Sym- 
bolically the situation index S is to be applied to the sign of the modal 
operation (©, ! or Ol for organic modalities, denoting respectively organic 
possibility, reality and necessity; ©, 1, & for aim modalities; P, F and O 
for deontic modalities; and M, R and N for epistemic modalities). So 
M,A means ‘perhaps, in the situation S, A’, etc. If the assertion M; 4, 
5A etc. is accepted for a class Y of situations S, it is denoted by M;<A, 
©yA etc. For aim modalities the sign T denoting the aim is to be added: 
& § A means ‘in the situation S it is possible to achieve T by means of A’ etc. 
But if the situation S or the class Z is fixed in the context one can omit the 
index S or Y in formulating the modal proposition. So one omits the class 
2 of all situations obtainable in mathematics in the formulation of the 
hypothesis of potential feasibility. 

There are several degrees of modalities. Modal words were used in the 
basic semiotic explications before the notions of class and situation were 
introduced. In those contexts it was impossible to use systematically the 
situation sign. That was the first degree of modalities. After the notion of 
situation appears, the second degree is obtained. 

The theory of deontic modalities is essentially that of tactics or characters. 
These are systems of rules, a rule being a proposition of the form ‘In the 
situation S it is permitted to do A’ or ‘In the situation S it is required to do 
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A’; the proscription of A is defined as the proposition of the form ‘In the 
situation S it is required not to do A’. The atomic actions A for the founda- 
tions of mathematics are: 


(I). preferences; 

(II). acts of establishing or following connections, in particular acts of 
attention; 

(III). collations; 

(IV). indications; 

(V). acceptances of propositions, i.e. sentences, rules and optional 
propositions expressing aims or sometimes desires or wishes; (the 
acceptance of a method is considered as the acceptance of all its 
rules); 

(VI). perceptions; 

(VII). the acts inverse to (II) and (V) — acts of neglecting and acts of 
refusal of accepted propositions; 
(VIII). the abstention from the acts (I}-(VII). 
In some theories related to the ultra-intuitionistic program, e.g. the theory 
of disputes, new items appear: 
(IX). raising and answering questions (considered as actions similar to 
(V)); 
(X). addressing another person 
and 
(XI). including some text in the memory; 
the abstentions (VIII) are extended to these new items. 


The tactic expressed in a language is called a method (see above; now 
the notions of rule and tactics are specified). Tactics are means for aims 
(as well as materials which are usually considered as obtained by a tactic; 
in addition to them, the means are the rules occurring in tactics, and the 
actions performed for the achievement of the aims; in particular those 
fulfilled in the course of following the rules, etc.). By means of tactics new 
complicated deeds A may be introduced; and this leads to the extension of 
the class of rules. 

In a natural way the notion of following a tactic is defined. A discrete 
process* is the following of a method and I shall say that it is described 
by the method. 


* Generally I take the word ‘discrete’ always to mean ‘expressible in a language of words’. 
Procedures are processes which can be accomplished. 


Al ULTRA-INTUITIONISTIC CRITICISM, ANTITRADITIONAL PROGRAM 17 


To follow a tactic y generally presupposes another tactic fy by means of 
which the actions A indicated by the rules of y are fulfilled, the situations 
S are collated, and so on. The tactic w is called external or deeper with 
respect to x or to a process described by x. The actions made in following 
yw but not yx are called automatic with respect to y or a process described 
by x. By the way, it happens often that a tactic y or a process described by 
it uses many external tactics and the notion of automatic action applies 
to each of them. 

All this is very important for my program. The ultra-intuitionistic program 
introduces tactics of acts (I)-(VII): but these acts envelop the profoundest 
acts of our thinking underlying the ultra-intuitionistic reasonings too, Even 
the reading of a text uses the acts (I}-(II1) (e.g. one prefers to read from left 
to the right) and it would be completely impossible to investigate in the 
program the many tactics of attention and collations used in the beginning 
of the exposition of it. But this is superfluous. Most of these tactics turn out 
to be external to the tactics of the deeds we really need to investigate. So 
one can assume that many actions (I)-(III) are fulfilled automatically, and 
even apply such traditional expressions as ‘the same word as before’ etc. 
Only in certain cases especially indicated by the ultra-intuitionistic criticism 
shall I have to describe explicitly the rules of the tactics: and this criticism 
is ruthless enough, for those cases form a class sufficiently large for all 
important aims. Other cases may be treated with the traditional negligence 
to the explicitness of these actions which are considered as the automatic 
ones. However, the investigation of these external tactics are of great 
importance e.g. for cybernetics — but for the ultra-intuitionistic program 
it is not an urgent business. 

Tactics may be incomplete for two reasons: (a) there may be situations 
for which the rules permit to make each of two or more incompatible 
actions (I call these situations the Buridanian ones) and (b) there may be 
unforeseen situations of which the rules say nothing. For both cases there 
are general ways of completing the tactics (though the completed tactic 
does not need to become complete). In case (a), generally, it is done by 
tactics of preference. In case (b) one chooses one of two fundamental 
preferences: according to the principle of liberalism one prefers permissions 
to proscriptions and according to the principle of despotism one prefers 
proscriptions to permissions. The acceptance of the first principle leads to 
the liberal regime characterized by the rule ‘everything not proscribed by 
(the rules of) a tactic is permitted by the regime’ and the acceptance of the 
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second principle leads to the despotic regime expressed by the rule 
‘everything not permitted by the tactic is proscribed by the regime’. One 
applies the principle of liberalism in cases one seeks for means for the 
accepted aims. One applies the principle of despotism in cases one tries to 
achieve aims with the help of accepted means, or to verify the proposed 
means with the help of accepted criteria. 

Definitions are the rules of usage of names. Their usage is always governed 
by the despotic regime which leads to prohibiting the application of the 
term introduced by definition in any other sense. All inductive definitions, 
e.g. that of natural number, contain an ‘indirect clause’ which is the im- 
position of the despotic regime on the consideration of an arbitrary object as 
satisfying the definition. 

The theory of methods and ways is a generalization of the traditional 
theory of algorithms. There are at least four respects in which the former 
is more general than the latter: (a) the primitive operations of the normal 
algorithms (the substitution of words) are performed in a way which is not 
itself considered as an algorithm; (b) the theory of algorithms depends on 
natural numbers, which are to be obtained in some way; (c) the methods 
can be applied to objects of a completely general nature; (d) the ‘work’ of 
a method or way (i.e. the following of it) may be an undetermined process. 

Now I turn to the alethic modalities. A situation is called real if each 
element of it is accepted on the ground of a perception. So the void situation 
is real. The problem of perception is of course a very deep one (and it is 
connected with the consideration of active and passive voices) but in the 
domain of the foundations of mathematics we can restrict our attention 
to the perception of texts proposed in the exposition of exhaustions and of 
the atomic acts (I)}-(V) and (VI1)-(X1) without further analysis. Each real 
situation is also called possible. 

The principle of modal fulfilment (pmf). If the situation S is possible and 
the action or event A is possible in S then the situation S U {A} (obtained 
from S by adjoining A to it) is also possible. 

This principle holds for all four groups of the modality ‘possible’ (all 
occurrences of which in the formulation of the principle have to belong 
to the same group). Symbolically it may be stated in the form 


5 S A 
Sv {A} 
where *O’ denotes one of the modality signs ©, ©7, M or P. It may be 
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specified that A stands in the future tense above the bar and in the present- 
past tense below it, symbolically 


S AA 
O ——————— 
Su {VA} 


A denoting the future tense and V the present-past tense. If O is © or P 
then A in ©;A is the name of an action, and can be expressed by a proposi- 
tion with one verb in the infinitive mood, in the ©,A or above the bar 
it stands in the active voice (‘it is possible to do A’) but in S U {4} it stands 
in the passive voice. Below the bar A stands in the indicative mood. If O 
is M, then A in M,A is the name of an event, and the verb in A stands in 
the indicative mood both above and below the bar, and the voice is the 
same in both cases. So this principle contains also the rules for tenses, 
voices and moods. 

The justification of pmf may consist in the definition of a possible situation 
as such which can be obtained from a real one by a string of applications of 
pmf. A closer examination shows that in this way pmf is reduced to its 
simplest case for which it is rather a tautology. 

Perhaps a somewhat more explicit formulation of pmf is 


(or still more precisely with AA above the bar and VA below it etc.). Here 
one sees distinctly that A stands with the modality ‘possible’ attached to it 
above the bar and without this modality below it. That is the essence of 
the rule explaining its name of ‘fulfilment’, and it turns out that pmf is a 
formalization of the ‘utilization’ of possibilities. However the modality 
‘possible’ does not disappear below the bar, but it is attached to the situation 
Su {A} itself. Often one forgets to mention this occurrence of the modality 
and speaks of the situation S U {A} as of a real one. So one obtains faith 
in S u {A}: and that is the general way of accepting-on-faith in the domains 
of religion and philosophy, and in traditional intuitionist or constructivist 
mathematics too. Such are the cases when one believes in mathematical 
‘reality’ on the grounds that its objects can be obtained by a method. The 
ultra-intuitionist analysis reestablishes the correct use of modalities. 

A deep idea of this program consists of representing the traditional 
mathematical ‘real’ situations only as possible ones obtained by iterations 
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of pmf. The consideration of mathematical situations as possible is cer- 
tainly sufficient for the sake of consistency proofs. 

Now I continue the review of my theory of modalities. 

If a process D is described by a tactic y and for a situation S occurring 
in the course of D the rules of y demand an action or event A then for the 
continuation of D in S the fulfilment of A is necessary. 

That is the main principle of necessity. It is in virtue of this principle that 
the definitions are observed in the course of a development of a theory and 
it seems that this principle itself can be based essentially on the definition 
of the word ‘describe’. It is also in virtue of this principle that each move of 
a bishop in a chess-game necessarily is a move by a diagonal. This principle 
may seem quite tautological. But a closer examination shows that it depends 
on an elimination of a double negation. What really can be justified is that 
in the course of a process the rules of a tactic describing it cannot be violated. 
It is not the same as the ‘fact’ that the rules are necessarily observed. But 
generally I neglect the eliminations of double negations occurring in the 
deepest questions of that program because it is the matter of an extreme 
direction to deal with them. 

If an event e of a process D is over and done with in a situation S of D 
and d precedes e in D then it is necessary that d is over in S. (The principle 
of ordinal necessity, p.o.n.) The ‘being over and done with’ is a form of 
expressing the past tense. So the principle deals with that tense too. 

There are also such ‘trivial’ principles as ‘everything necessary is real’ 
and ‘everything real is possible’. These are stated only for the two groups of 
alethic modalities and for possible situations only. (The reason of the 
impossibility of a situation may be precisely a violation of some trivial 
principle in it.) For deontic or aim modalities the observance of these 
principles is the characteristic feature of just or purposeful activities. There 
are also some relations between organic and epistemic modalities (e.g. 
that organic necessity implies the epistemic necessity) — these too are stated 
for possible situations only. Various other connections — the distributivity 
of conjunction and the equivalence between the impossibility of A and the 
necessity of not-A — can be established (with some restrictions) for possible 
situations. 

The violation of the main principle of necessity is impossible in each 
possible situation. (The principle of negative evidence, p.n.e.) 

For aim modalities I accept, also for possible situations S only: 

If the aim Tis not achieved in S, in order to achieve it in S it is necessary 
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to apply (a) some sufficient means and (b) each of the necessary means 
(the inversion principle). 

Each system of actions is sufficient for the achievement of each of its 
necessary result. 

For the future tense I deal primarily with the tense ‘A will have been’ 
which I denote A,A or simply AA if the situation S to which this ‘will’ 
refers is clearly indicated in the context. I accept: 

(a). If A has been in S it will have been in every situation which is later 
than S. 

(b). If A is necessary in every situation of a class 2 of situations, A is 
necessary also in each future situation of Z. 

(c). In each possible situation an appearing event is always later than 
events already entering in the situation, and this order relation will neces- 
sarily be observed in every later situation. 

Also the rules occurring in pmf and pon are very important for the theory 
of tenses. The following rule is of a semiotical nature: 

(d). In order that a proposition be meaningful it is necessary that its 
predicate agree in tense with the objects to which it refers. This means e.g. 
that the phrase ‘tomorrow is August 12’ is meaningless if pronounced 
August 11; a meaningful phrase is on that date ‘tomorrow will be August 12’. 

(d) is very important for the analysis of deductoids exposed in the 
ordinary logico-mathematical language having no future tense. A sentence 
A(m, n), containing the names m, n of the events m, n of a process, can be 
meaningful only on the supposition that m and n have already occurred. 
(It could be otherwise if the predicate in A(m, n) stands in a future tense.) 
The deductoids may contain the rule (Ca) and so the sentences occurring 
in them constitute an infinite process. The identifications required for the 
substantiations of the applications of the rules of inference etc. may introduce 
a Zenonian situation (even in a narrower sense), and therefore it is not 
evident that the deductoid must possess an establishment of convincingness. 

So a sentence A(m,,...,m,) may occur in a deductoid only on the 
supposition that m,,..., m, have already occurred. That is the first genetic 
principle for ultra-intuitionistic proofs. The second genetic principle is that 
if n,,...,”, are events of the same process I shall say they have all ensued 
only if a later event n of the process has ensued (cf. p.o.n.). The third 
genetic principle is connected with the inversion principle: e.g. I cannot 
consider an occurrence of A in B (or even an occurrence of A depending 
on an occurrence of B) before this occurrence of B is considered as already 


22 A. S. YESSENIN-VOLPIN Al 


having occurred. These three genetical principles constitute a genetic 
constitution which is to be observed in the ontological and especially in the 
genetic theory. (In the ontological theory as well as in prototheories the 
sentences A(m, n) containing the future tense may occur; and then the first 
genetic principle may be violated. E.g. in the definition of a natural number 
series there is the location ‘n’ will be ensued’, to which a modality may be 
attached; the application of pmf to this location belongs to the ontological 
theory. Cases of such a nature with the future tense are only ones dealt 
with in the ontological theory.) 


8. Now I return to the consideration of the natural number series and the 
assumptions T8-T11. 

First of all, I have considered a series F without 10!? but it does not 
mean a number q has to be so great as 10’? in order it could be longer than 
some natural number series. If a series N contains a 20th event this does 
not mean that another series M has to contain its 20th event also. In the 
notion of series there is no proviso about this. 

So let us consider two necessitary series N, and N,, whose numbers 
I denote a5, @,,ee and 55, 0), tespectively. Wet yy, contain c,, e204 
let N, be shorter than 6,9, which means that there never shall be such 
entity as dz) in N,. (Later I shall prove the possibility that N, does not 
contain even a,.) Let N, N, be the series ageb,, a,. Dee. 0;, O,, Cee 
obtained from N, and N, by alternation. It really is a series because if 
a; has occurred then i < 20 and there is b; in N, and a;,, necessarily will 
have occurred; if b; has occurred in N,N, this means that a; has occurred 
in N, and so a;,, necessarily will have occurred. Now let us consider the 
task: to find the occurrence of 6,5) in N,N,. This task requires only 42 
steps each of them being alethically possible when the preceding steps are 
accomplished. Nevertheless the fulfilment of the task is impossible because 
if by5 is found in N,N, then aq also is found in it, and so dy, is in N,, 
which contradicts the condition. 

I say there is an obstacle to the fulfilment of this task, and I call this 
obstacle the catching of this task on N,. Of course catchings may arise 
also from more complicated alternation, e.g. the task of finding bj, 999 in 
the series dy, bp, @,, by, 65, dz, 53, 04, Os, Gg, Dg, 67; 0g, Oo, 47,-- (ine 
numbers of b’s after a;,, is one greater than after a;) also catches on N,. 
Generally catching is (roughly speaking) an obstacle to a task consisting 
in the fact that its fulfilment requires the exhaustion of an infinite process. 
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In the case of a Zenonian situation (in the narrower sense) with the 
field {0, 1,..., 10'7} of N-numbers the task of the structural identification 
of that field in its two occurrences catches on the series F of feasible numbers. 
So every Zenonian situation in the narrower sense is impossible on the 
grounds of a catching. Conversely, to every catching it is possible to associate 
a Zenonian situation (e.g. obtained by the identification of 5s in N,N, 
and in N3, the first one’s forming the infinite series; this situation is one 
in the narrower sense if one requires the structural identification of the 
segment {by, ..., bz} of Nz inits occurrences before and after this introduc- 
tion of infinity into it). 

The introduction of the notion of obstacle is a most important feature of 
the ultra-intuitionistic program. Of course the constructivists were also 
aware of some ‘trivial’ obstacles for their constructions, such as the absence 
of necessary means or a vicious circle in a task. But these obstacles always 
have been the barriers for a single step in the fulfilment of the task. The 
catching is a new kind of obstacle arising from the possibility of a finite 
sequence being longer than an infinite series. Each ‘finite’ task whose fulfil- 
ment is destroyed by a catching I[ shall call the field of this catching. 

The possibility of a catching frequently appears in the genetic theory, 
whose principal aim is to obtain a list of conditions sufficient for a deductoid 
to possess a semiotic substantiation undamaged by catchings. In this theory 
catchings are represented by sequences whose members are just the occur- 
rences like those of 5; in N,N ; and these sequences are represented by 
functions @, whose values are those occurrences and whose arguments are 
the numbers of a short natural number series on which the occurrences 
depend. E.g. b; = (a;) where a; is in N, and 5; is in N, Nj. I call these 
functions metafunctions (because the occurrences of the members of the 
sequences are traditionally considered as metatheoretical objects). Each 
catching in the genetic theory may be represented by a metafunction given 
for an infinite process (containing the arguments of the metafunction) and 
all of whose values must belong to the field of the catching and be distinct 
— these are called unbounded metafunctions. 


9. Now I pass to my criticism of T8. I wish to introduce a new branch 
of logic — relevancy theory — into the foundations of mathematics. It is a 
theory about how to take something into attention or to neglect it. 

These problems appear at each step in our thinking but the traditional 
logic worries too little about them. Everyone knows the syllogism: all men 
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are mortal, Socrates is a man, therefore Socrates is mortal. But another 
argument is no worse: All men can die only once, Socrates has died, therefore 
now Socrates cannot die and Socrates is not mortal. (I suppose that the word 
‘mortal’ refers only to beings who can die.) That is a paradox. Its explanation 
I see in the fact that the word ‘man’ had different senses in this text, one time 
denoting all men — living and dead — and one time denoting only living 
men. In order to eliminate all paradoxes of this sort one has to specify the 
way of following the denotational connections. Every way of following 
connections I call here a tactic of attention (but I mean mainly following 
the aim or denotational connections). 

Perhaps one feels the problem of this tactic is strange to the foundations 
of mathematics, but even if it is so this fact belongs to relevancy theory and 
one has to prove it, which already requires some acquaintance with this 
theory. That is already a criticism of T8. 

The main réle of relevancy theory is attached to the fact that one always. 
may neglect strange objects, i.e. objects strange to the aims of considerations. 
With some trivial provisos I insist upon the inverse principle: on/y strange 
objects may be neglected, which I call the main principle of relevantism. 
This means: first prove the strangeness of an object and only then neglect it. ~ 
This presupposes that the way of establishing or following connections, i.e. 
the tactic of attention is sufficiently well known. 

In traditional mathematical logic one has never argued about tactics of 
attention and rules of connection and neglection. One could think that the 
rigid system of definitions makes that superfluous. But the rigidity of this 
system depends on the notion of natural numbers and so it cannot be more 
rigid than this notion is. The rejection of T1 is already the rejection of this 
rigidity: and so the problem of introducing the tactics of attention into 
consideration becomes a very urgent one. 

But the means of traditional theories are not sufficient in order to fix 
in a unique way the supposed tactic of attention. The existence of con- 
sistent but w-inconsistent systems may be interpreted as revealing this fact 
even by traditional means. 

I shall apply the relevancy theory to the study of metafunctions. Strange 
objects are neglected and therefore instead of proving the impossibility of 
a metafunction p destroying a construction C I can prove that each such 
metafunction p must be strange. This I prove chiefly with the help of the 
following principle of strangeness which can be established in the relevancy 
theory: 
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If an object © is indispensable for any consideration of p but one considers 
C without taking € into attention then p is strange to C. 

The importance of this principle is due to the fact that when an ultra- 
intuitionistic theory T considers two natural number series F and N, 
F < N, one always can intentionally introduce the function p(x) on F 
such that p(m) is always the N-number equivalent to m and if F< qeN 
then this p enables one to introduce the Zenonian situation with the field 
qg = {0,1,...,¢} where all of 0,1,...,¢ are N-numbers. As explained 
before one can further consider a catching destroying an appearance Oc, 
of qg. But this does not mean one has to worry about it because it is possible 
that Oc, has nothing to do with T. q is one thing which may be really 
necessary for the study of N but Oc, may be strange to the theory T or to a 
part T’ of it in which this question arises. If p creates an obstacle for Oc, 
but not for the consideration of 9, then one may neglect this obstacle as a 
strange one provided one is interested only in @ but not in Oc,. One can 
neglect p in such cases even if the obstacle created by p impedes a con- 
struction of 9 considered in connection with Oc,. Let € be the equivalence 
relation between F- and N-numbers. Although © is a typical ultra- 
intuitionistic object it may happen that in T or a part T’ of it there is no 
need for the consideration of it: and then one can suppose that T or at least 
T’ is really being considered, without taking © into attention. Let g be 
considered in T or T’ respectively. Then in T (or T’ respectively) p is strange 
to g because € is indispensable for any consideration of p. 

Of course there may be cases of much more complicated metafunctions 
than this p. Typical objects © are e.g. fixations of parameters. Usually any 
ultra-intuitionistic consideration T’ depends on some chosen natural 
number series N;,..., N, which may be considered as the fixed values of 
parameters N,,..., N, for the series. Then T’ may be considered as the 
result of assigning these values to these parameters in a more general con- 
sideration T. Let Nj’,..., N;,’ be another system of values for N,,..., Ny 
not mentioned in T’. Then if p is a metafunction whose definition uses 

4>--+N;/ or Nj'-numbers qj) such that N; <q; (j= 1,...,k) then p 
is strange to T’. The nature of © may be completely arbitrary, e.g. © may be 
using of a tactic of attention x in a situation where there is no given tactic of 
collations for objects considered with the help of x. With the help of such 
a © one can prove the strangeness of a metafunction p defined in a manner 
independent of any tactic of collation; and with the help of a suitable © 
one can do the same, if the tactic of collation of p essentially differs from that 
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used for the elements of the supposed field of the catching represented by p. 
In such a manner I prove that a sufficiently symmetrical object cannot be a 
field of catching — this idea is well known in traditional mathematical logic 
(cf. e.g. Mostowski’s proof of the independence of the axiom of choice). 

There are some principles of preference for the tactics of attention in 
connection with obstacles. When one seeks means for the accepted aims 
one can prefer the attention to a means to the attention to an obstacle for 
the application of the means. But when one has chosen a means in order to 
achieve an aim, e.g. when one carries out a construction, one has before all 
to take into attention every obstacle for the aim. Nevertheless even here the 
word ‘obstacle’ cannot refer to any strange obstacle. 

The choice of a tactic of attention may depend (and usually does) on the 
accepted aims. But for the most part this choice is made implicitly and 
automatically with respect to the considered activity. Even in my own 
program, the cases in which I explicitly consider the rules of such tactics 
are comparatively rare (but important). 

A natural number series N is defined as a discrete process having an 
initial event 0 and a unary leading operation n’ with an evident (external) 
tactic of collations for its events and a despotic regime imposed on its events. 
This regime belongs to the tactic of attention which one has to accept in 
order to follow the notion of an N-number but it does not define this tactic 
uniquely. In order to see this, I shall consider the construction of a very 
short natural number series. 

Generally for constructing a natural number series N one can take an 
accomplished event — say, the indication of a void place — as its zero and 
take the indication of n as the leading operation n’. If one wishes to receive 
in this way a necessitary series N, one has to accept the rule requiring the 
indication of 7 in the situation that 7 has occurred and is distinguished from 
all preceding N-numbers. The necessitary series is also a real one. If one 
wishes to achieve an eventual series, one can accept two permissions (a) and 
(b) in place of this requirement: (a) it is permitted to indicate n; (b) it is 
permitted not to do so. In all cases the events, as they appear, are required 
to be distinguished from all preceding ones; and the fulfilment of this 
requirement is to be given by a single action with a parameter m for the 
preceding numbers. Then the sentence m # n may be accepted for each 
value n of the N-numbers and the m’s preceding that n, and by the fixing 
of m with m preceding n one can obtain the sentence m # n as an accepted 
one (see $3 above). But N is not fixed till the tactic of attention to its 
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numbers is specified and the notion of N-number remains not clear. Now 
suppose (by S3) this tactic is fixed in some manner. Then this notion becomes 
clear though depending on the undefinite value of this fixation. Let this 
tactic of attention be denoted by x. 

Now I introduce a new tactic t of attention to the N-numbers. At each 
stage n of N (ie. after the appearance of n before the appearance of n’) 
Tt permits one to take into attention only n' and the N-numbers < n. (The 
word ‘only’ is an abbreviation for the introduction of the despotic regime.) 
I call t the short-sighted tactic of attention. The series N considered with 
this tactic t I denote by N,. 

At each stage n of N, only the N,-numbers < n’ are considered as N,- 
numbers and there is no such entity as n’’ amongst all possible N-numbers. 
Evidently N, is not closed with respect to the function n”. If n = 0 then 
(at the stage 0) N, is to be considered as a series shorter than 2 (the number 
2 being taken from another series, perhaps from NV). As a matter of fact, 
after the sentence So asserting the appearance of 0’ in N, is accepted, then 
(at stage 0’) the name ‘N-number’ will denote just the entities 0, 0’ and 0” 
but not 0’, and so on. At each stage of every reasoning one has to consider 
which are the accepted sentences of the form S;. Here it is useful that the 
word ‘accepted’ stands in its passive form. 

That is the proof of the possibility of very short natural number series, 
and this proof is independent of all empirical considerations like that of 
unfeasibility of the number 10'*. Of course it is as yet too vague, because 
still I have not expounded the ultra-intuitionistic reconstruction of logic; 
and without this one does not know how to deal with N,. I criticize T4 but 
at present I have not secured even the simple applicability of the postulates 
of intuitionistic logic. So I have to reconstruct the substantiations of these 
postulates (see T11) in order to see that the substantiation is independent of 
the chosen tactic of attention, or at least that the short-sighted tactic is a 
suitable one for the acceptance of the postulates. This follows essentially 
from the fact that a tactic of attention has always underlain the consideration 
of the traditional intuitionistic postulates, a tactic deeper than its substantia- 
tion. Its properties have never been studied: and this may be explained by 
the fact they are almost inessential. The only important thing was that one 
can follow a chosen tactic and consider it as a fixed one. But the short- 
sighted tactic for N, contains even less requirements of action than the 
original tactic y for N and so one can follow it. There can be no obstacle to 
combining two tactics y and t so as to follow the first in considerations of 


28 A. S. YESSENIN-VOLIPN Al 


N-numbers and the second in considerations of N,-numbers: and with this 
combined tactic of attention one can consider the pair of these series. 
Besides this one has to remark that in purely logical reasonings about the 
given objects and processes nothing forces us to accept such a sentence as 
So. This fact belongs to the contemplations of voices, and leads immediately 
to the possibility of consideration of several natural number series at once. 
Of course, from the series NV, without 2 one can obtain a series with the 
arbitrary given number & from another series K but without 4 +2: it suffices 
to add N, to the accomplished beginning {0,..., k} of K. 

It is useful to remark that the rules of tenses prevent the acceptance of 
the appearance of 0’ in N. I can assume that N and JN, are necessitary. 
According to the definition of aseriesO has occurred (= is) in N,, and 0’ 
will have been, but there is no rule enabling to obtain from that the sentence 
that 0’ has been (= is). After 0’ will have been in N, 0’ will have been also, 
but a future relative to another future is not yet a future. So one cannot 
obtain a sentence about the appearance in N, of its future event 0”. 


10. Hitherto I spoke only about destructive critical ideas insofar as the 
possibility of constructing a sufficiently long natural number series seems 
to be diminished by this criticism. But the theory of collations leads to a 
contrary result, it restores the possibility of considering great finished 
totalities of objects. The operation of duplication of a given finished totality 
P = {a,,...,4,,} is accomplished in the following way: given one 
occurrence Oc, of P, form another one Oc, simply by indicating Oc, once 
more and distinguishing P and a; (i = 1,...,n,) in their old occurrences 
(through Oc,) and the new ones (through Oc,); then it is sufficient to 
indicate the pair of indications on Oc, and on Oc, and obtain with the help 
of S1 the indications on 2n, objects 0a,,...,0a,,, lay,..., 1a,, (0a; 
denoting the a; in its occurrence through Oc, and 1a; the occurrence of a; 
through Oc;). So with the help of indications and distinctions one obtains 
the duplicated totality 2P = 0a;,...,0a,,, la,,..., 1a,,. It is then natural 
to identify Oa; with a; (i = 1,...,n,). Starting with P = {0} one obtains 
after 40 steps of duplication a totality with 2*° (> 10'*) members. S1 is 
used for indications of these members. 

This is still not a natural number series with 10'7. In order to go further 
I have to consider a hypothesis related to T9: 

The central ontological hypothesis (c.o.h.). Let E be an accomplished 
procedure (called the basis of c.o.h.). Let for each step e of E /, be a finite 


Al ULTRA-INTUITIONISTIC CRITICISM, ANTITRADITIONAL PROGRAM 29 


procedure feasible if the /,’s are accomplished for all steps d preceding to 
ein E. (I call /, the Joad of c.o.h.). Then a procedure E, is feasible which 
consists in the reproduction of E with /, following each step e (before the 
later steps f of E are accomplished). 

C.o.h. is a very strong hypothesis which practically implies all applications 
of T9. It is a hypothesis to the effect that the applications of pmf may be 
iterated; and this is the only kind of iteration of pmf allowed in my program. 

C.o.h. deals only with the feasibility, i.e. the organic possibility, of J, and 
E,. There is no counterpart of c.o.h. for epistemic possibility in my theory. 

C.o.h. asserts only the feasibility of E,. Feasibility is a kind of possibility 
— the possibility of a construction — and if one wishes to assert the presence 
of E,; one has to accompany the application of c.o.h. by that of pmf. 

The acceptance of c.o.h. leads to a third degree of modalities and the 
strength of c.o.h. consists in the extension of the old rules to it. But one can 
show that c.o.h. is not a sufficient means for obtaining an impossible situa- 
tion from possible ones and so one can justify the applications of c.o.h. 
in the consistency-proof for ZF. For the leading idea of this proof is, that 
if ZF were inconsistent then a violation of the main principle of necessity 
would hold in a possible situation obtained with the help of c.o.h. (This 
violation would consist e.g. in the fulfilling of an identification of 0 with 
0’ in the course of an activity prohibiting this identification.) 

There is another principal hypothesis in my program: A// (non-trivial) 
obstacles other than catchings are strange. This hypothesis on obstacles 
essentially is not new to the foundations of mathematics because in tradi- 
tional mathematical logic even catchings are considered as strange. With 
the help of c.o.h. I can essentially show a protodemonstration of this 
hypothesis. But the detailed justification of c.o.h. and this hypothesis 
belongs to some extreme directions of the program. 


11. In connection with T10 I say that already the introduction of tactics of 
attention and collations makes the structure of logical theories more com- 
plicated than is provided for by T10. Often it is inconvenient to consider 
the collations used in a text at the same level as the text, and one has to 
return from the collations to the collated signs. The consideration of aims 
introduces pragmatics as a part of the semiotics under investigation. I don’t 
exclude the importance of the two levels considered in T10 but I find it 
insufficient for my aims. I deal essentially with semantical and pragmatic 
considerations connected with metatheories. (Following Curry I prefer 
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the term ‘epitheory’ reserving the term ‘metatheory’ for a well-defined epi- 
theory.) It is hard to answer whether the rules of inference belong to the 
object theories or to the metatheories in traditional logic. This question 
becomes still more difficult if one analyzes these rules and studies the means 
of analysis in a third theory. Certainly the new means do not belong to the 
object theory but the metatheory then splits. A fortiori it splits when we 
introduce considerations of an essentially new kind dealing with the estab- 
lishment of convincingness. The structure of the new program is to be 
described in terms of methods, tactics and aims rather than in terms of the 
traditional division into object and metatheories.* 

As concerns T11, I say here only that for most of the intuitionistic postu- 
lates one can repeat a way of substantiation available in the traditional 
logic. There are several methods, and the substantiation is always connected 
with the interpretation chosen for the ordinary logical operations. I have 
to carry out such a substantiation in order to see that it is compatible with 
the requirements of my prototheories: relevancy theory, collation theory, and 
the theories of the modalities, tenses and voices. Here I note that the 
traditional postulate — A > (A > B) was reduced by A. Heyting to | 
(Av B) & A > B. In order to substantiate the latter I use the permission 
to identify with b ‘the member which is distinguished from a of the pair 
{a, b} of distinguished objects’. It is not difficult to see that this permission 
is compatible with the requirements of collation theory. 

In connection with Bernays’ axioms VxA(x) > A(t) and A(t) > 4x A(x) 
I note that they presuppose the denotational connection x — t, These 
connections I call simply ‘arrows’. For a single axiom this is merely a trivial 
remark: but if a great totality of such axioms enters in a deductoid the 
study of these arrows becomes a large part of the establishment of con- 
vincingness, 

Consider two of Bernays’ axioms Vx A(x) > A(t(y)) and VyB(y) > 
B(s(z)) entering in a deductoid. They are to be accompanied by the 
arrows x — t(y) and y — s(z) respectively. The first of these arrows shows 
that for each value of y t(y) is an admissible value of x, and the second 
shows the same thing for z, s(z) and y. Then if the variable y is to be identi- 


* The notion of ‘formalization’ is now to be enlarged; ‘to formalize’ the use of a notion 
means for me ‘to expose a method of using its name’. Perhaps in this deviation from the 
traditional understanding of formalization by means of a formal system lies the essence 
of my criticism of T10. (N.B. A method = a system of rules (expressed in a language). 
Formalization is to be understood as any establishment of a system of rules.) 
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fied in its occurrences in both axioms it is natural to suppose that t(s(z)) 
should be (for each value of z) an admissible value of x. It is denoted by 
the arrow x — t(s(z)) which is also to be inscribed in connection with the 
deductoid, I say this arrow refers to it. If a third Bernays’ axiom, say 
C(r(w)) > 4zC(z), accompanied by the arrow z—>r(w), occurs in the 
deductoid, and the variable z is to be identified here with the same variable 
in its former occurrences, then in the same way the arrow x — f(s(r(w))) is 
to be considered as referring to the deductoid; and this means that (for 
each value of w) ¢(s(r(w))) is an admissible value of x. This fact in its 
general form is justified by means of S1-S3. But it must not be assumed that 
the composition of functions (with admissible domains and ranges) always 
leads to a well-defined function. E.g. every natural number series must be 
closed relative to n’ but N, is not closed relative to the function n’’. So the 
semantical conditions expressed by arrows may be violated. 

One of the principal concerns of the establishment of convincingness is 
to show the correctness of all arrows referring to the deductoid. There is 
a strict system of rules governing the notion of an arrow referring to a 
deductoid. They are in agreement with the requirements of collation theory, 
e.g. if an arrow x — t refers to a deductoid, and ¢ in this occurrence is to be 
identified with s somewhere in the deductoid then the arrow x > s also has 
to refer to the deductoid. (This arrow x > s is called the identificational 
arrow.) Many complications arise from the fact that rule (Ca) (instead of 
T3) is allowed as a rule of inference. But the main difficulties resulting from 
the use of (Ca) are connected with catchings. The latter are studied as 
unbounded metafunctions, and the second principal concern of the establish- 
ment of convincingness is to show that every such metafunction is strange 
to the deductoid. There are several ways to do it, e.g. the method of sym- 
metry, analogous to that used by A. Mostowski in his proof of the indepen- 
dence of the axiom of choice. This method is applicable when one has to 
show that a symmetrical part of a deductoid cannot be the field of a catching. 
The underlying idea, that one cannot define a single-valued function taking 
infinitely many values in a sufficiently symmetrical field, is made more 
precise by means of the principle of strangeness from relevancy theory. 

There is a kind of identification which is dangerous in the sense that it 
can very easily create a Zenonian situation. Suppose that ¢ is a function 
symbol entering in a deductoid (gy denotes a function defined in a studied 
process and taking its values also in a studied process) and that there is an 
infinite class of identifications of the values of g with different parts of a 
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finite object E (e.g. a term or formula entering into the deductoid). Then E 
becomes the field of a Zenonian situation. I call such identifications dangerous; 
this danger is of great importance, especially in the case when g is defined 
on a short natural number series (which is shorter than £), but theoretically 
even a long series may be cofinal with a very short subsequence of it. From 
now on I suppose that all deductoids considered are free from dangerous 
identifications unless the contrary is allowed explicitly. 

There is a protodemonstration of the fact that a deductoid without 
arrows always possesses an establishment of its convincingness.* The same 
is easily proved for deductoids without the use of (Ca). But this assertion 
depends on the fact that all deductoids are considered in a form not making 
use of open formulas; the axioms of Bernays mentioned are to be replaced 
by their closures. 

In order to say more about the form of deductoids I add here that only 
closed formulas are admitted in them; this affects the list of axioms for 
quantifiers. So only closed formulas of the type Vx A(x) > A(t) are admitted 
as axioms and in the arrow x — ¢ the term f¢ is a constant. But for each 
application of (Ca) with the premises P(m) whose deductoids use an arrow 
x + g(m) the conclusion VmP(m) is accompanied by the arrow x > o(m) 
which is said to absorb the arrows x > y(m). Thus the closure of a formula 
VxA(x) > A(t(y, z)) is proved with the help of two applications of (Ca), 
and is accompanied by the absorbing arrow x > ¢(y, z). Each arrow x > f 
accompanying a premise of a rule of inference accompanies also its con- 
clusion; and if x occurs in the other premise the arrow accompanies the 
latter premise too, provided x is identified in both premises. Thus arrows 
are repeated and I call them in their new occurrences the induced arrows. 
Now the rules for obtaining the composition x — t(s(z)) from two arrows 
x — t(y) and y > s(z) (and for the analogous compositions involving terms 
t(y, uw) with two or more variables) are formulated in a syntactically natural 
manner. (If an arrow is obtained by several acts of forming compositions of 
x > t(y) and y > s(z) etc., then the different applications of these acts are 
connected to distinct acts of identification of the variable y in the formulas 
of the deductoid.) If x enters into a formula with the arrow x — ¢ through 
a term g(x), it is required that g(t) should be an admissible value of a 
variable. A special interpretational occurrence of ¢(t) is introduced in such 
* Besides the absence of dangerous identifications, this prototheorem presupposes that 


all applications of (Ca) are of a nature really encountered in my investigations; cf. the 
second paragraph on page 31 or the third on page 35. 
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cases. (The word ‘term’ is replaced by ‘termoid’ if it is not certain that for 
each system of values of its variables the term denotes an admissible value 
of a variable.) 

One can reduce even the first principal objective in regards to arrows to 
the problem of proving the impossibility of non-strange catchings; a fortiori 
it is true for the second objective introduced especially in connection with 
catchings. The latter being the only (non-trivial) obstacles, one can say 
that the establishment of convincingness is the verification of the absence of 
(non-strange) obstacles for the construction needed in connection with the 
substantiation of the deductoid. 

Recall now the well-known theorem of Bernays and Gédel concerning 
the expressibility in the ordinary arithmetic of every prf @. If one analyzes 
the occurrences of arrows in the proof of this theorem (represented in a 
form where T3 is excluded in favour of (Ca)), one notices that the correctness 
of the condition on arrows presupposes that the natural number series 1s 
closed relative to 9. 


12. The establishment of convincingness of a deductoid consists mainly in 
considering a class called the envelope of the deductoid. The deductoid itself 
is a finite object: the applications of (Ca) are represented in it by their tasks. 
(Two tasks of (Ca) are to be identified if for every m they lead to the identi- 
fied deductoids of the premise B(m).) If an application of (Ca) with the 
premises B(m) enters into a deductoid P then the deductoids for the premises 
B(m) are not presupposed to appear before the conclusion ¥mB(m) of that 
(Ca). This is connected with the fact that the truth of Ym B(m) is interpreted 
as the truth, i.e. the feasibility of a proof, for each B(m) (cf.(f) on p. 10), 
but not as its presence. I say that the deductoids for B(m) refer to P but not 
directly (whereas the task of (Ca) and the formula of P refer to P directly). 
The envelope of P, called eP, consists of all objects referring to P; these are 
the formulas of P and of deductoids of premises of (Ca) referring to P, 
the arrows belonging to formulas, the tasks of (Ca) and the applications 
of the other rules of inference, the interpretational occurrences of termoids 
and all collations required for these objects as well as the establishments of 
exhaustion refering directly to their structural procedures. For each process 
studied, I introduce the corresponding semantical process. The events of this 
are the names obtained for the events of the studied process, according to 
S2. Each such name appears in the semantical process after its sense has 
occurred in the process studied; it is called a semantical event. These events 
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refer to P and for each reference of a constant termoid ¢ to P the structural 
identification of ¢ in this occurrence with the synonym of ¢ occurring as a 
semantical event refers to P. (Structural identifications for two synonyms 
are defined in a natural manner with the help of identifications for more 
simple synonyms occurring as their parts.) These semantical identifications 
represent the interpretations of constant terms occurring in the envelope 
eP. Also for each m of an application of (Ca) a semantical identification of 
that m in the semantical process and in B(m) is required, in order to 
guarantee the coincidence of the studied process mentioned in connection 
with different applications of (Ca) related to the process. All objects of the 
envelope eP are considered as appearing in accordance with the requirements 
of genetical constitution. This entails that Zenonian situations destroying 
the feasibility of the tasks of the elements of eP may appear, and one of the 
requirements of the establishment of convincingness is to guarantee that 
this will not be the case. The constructions involved in the semiotical 
justifications of the intuitionistic postulates don’t require the appearance of 
any event except those which have to have occurred in order that the 
postulate be meaningful (in its reference to P). So these constructions are 
always feasible on the grounds of c.o.h., and they cannot introduce new 
Zenonian situations or catchings. On these grounds these constructions 
are not introduced in eP (but semantical identifications have been included 
in eP so that this introduction might be possible without any essential change 
in the feasibility of the other elements of eP). 

There are two principal kinds of arrows: 

(a,). those accompanying the Bernays axioms, the absorbing arrows 
accompanying the conclusions of (Ca) and the identificational arrows; 

(a2). the arrows obtained from the arrows (a,) by composition. 
For (az) as well as for (a,) there are induced occurrences; and compositions 
are to be made only from arrows belonging to the same occurrence of a 
formula. The rule for obtaining the identificational arrows is as follows: 
Let an arrow x > ¢, belong to a formula F and let Oc,,..., Oc, be the 
occurrences of terms f,,...,f, such that for each two Oc;, Oc;,,; (1 < i <n) 
either the identification of ¢; and t;,, in these occurrences refers to P or the 
occurrences Oc; and Oc;,, are parts of an equality ¢; = ¢;4, (or of a 
synonym Eq (t;, t;;,) of such an equality). Then the identificational arrow 
x — t, also belongs to the formula F in the same occurrence of the latter. 
The identificational arrows refer to P. But it is allowed to neglect them 
if x — t, already has an occurrence belonging to the same occurrence of F 
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or if there is a protodemonstration that the string Oc,,..., Oc, is strange 
to the construction of elements of eP. (The equality t; = t;,, may be 
proscribed in the construction of these strings if it is certain that it does not 
express an identification of ¢; and t;4,, e.g. when ¢; and f;, , are the events of 
different studied processes.) When it is not clear if an identificational arrow 
belongs to a formula, I prefer to count it as belonging to it. (Superfluous 
arrows can only damage the establishment of convincingness.) The rules for 
identifications related to parts of arrows are formulated in a natural manner. 

There is an important requirement called the requirement of termoidal 
completeness. When a task t(x) of a name ¢(x) of an event of a studied 
process refers directly to the task of a deductoid (e.g. through a task of 
(Ca)) for indefinitely many x’s, then it is required that there is a functional 
symbol ¢(x) representing this t in the (envelope of the) deductoid. It is 
required that for each of those x’s an identification of ¢(x) with t(x) refers 
to a deductoid or at least that there should be a string of such identifications 
uniting ¢t(x) with t(x). This requirement is in accordance with those of 
collation theory. It guarantees that one cannot prevent the appearance of 
the absorbing arrow x —> g(m) by replacing the g(m) by their synonyms 
not containing ‘¢’. 

The tasks of (Ca) may be, generally, of a completely arbitrary nature. 
But in practice I need for my aims only several simple cases of these tasks 
requiring no semiotic pathology. (The most difficult of them are connected 
with the representation of some variants of T3.) If all (Ca) are restricted 
to these simple cases there is a protodemonstration of the fact that the 
requirement of the termoidal completeness is fulfilled if the non-absorbed 
arrows (of type a,: cf. p. 34) are finite in number. The latter condition 
I call that of the finiteness of the arrows. 

For every element I of eP I denote by [I the finite set of objects preceding 
I in order of genetic constitution. (All these objects are to belong to eP.) 

Now I can say that the establishment of convincingness of a deductoid P 
consists in three requirements imposed on P and on deductoids P’ referring 
to P. 

I. every termoid referring to P shall be a term; 

II. the requirement of termoidal completeness; 

Ill. for every I] in eP’ each unbounded metafunction whose values belong 
to [7 is strange to the considered task of P’. 

In accordance with S3 the deductoids as well as their establishments of 
convincingness may depend on various parameters. These may be fixed in 


36 A. S. YESSENIN-VOLPIN Al 


an admissible manner through all occurrences of them; the establishment of 
convincingness remains an establishment thereafter. 

There are many reductions of I-III to sufficient conditions of a finitistic 
type. E.g. I observe for condition I that for the arrows x > ¢t(y) occurring 
in deductoids of formulas Va(P(n) > P(n’)), if a variable y occurs in t(y) 
it does not occur as the left part of an arrow, which saves the envelope from 
an accumulation of functional symbols in the right parts of arrows obtained 
by the composition of x > t(y) with the arrows y > s(z). Note also that 
from / arrows of the form x — t where ¢ is another variable y or y’ one 
cannot obtain an arrow x > r where r contains more than 2?" strokes. 
Of course if K; is a natural number series such that for each « in it there 
must be a K,-number equivalent to «+2?' this leads to some simple finitistic 
conditions sufficient for I. Requirement II is mostly replaced by the condition 
of finiteness of arrows. The fulfilment of III is guaranteed by a number of 
conditions dealing with certain symmetries of different parts of deductoids 
etc. 

I mentioned above that the fulfilment of the condition on the arrows 
can be reduced to the impossibility of some non-strange catchings. Now 
I can say that the field of those catchings consists of the numeroids, i.e. 
the termoids obtained by c.o.h. as a result of calculation of given termoids; 
the obstacle is an obstacle to the semantical interpretation of the numeroid 
and consists in the fact that the parts of the numeroid catch on a semantical 
process (or on a studied one). 

Now I shall describe briefly the semantical interpretation of the usual 
logical operators preferred by me in connection with T1l. A > B is inter- 
preted as the presence of a proof of B from A (this proof being of course of 
a somewhat more simple nature than that wherein the A > B occurs; 
e.g., this proof may depend on a more simple value of a parameter or on 
a more simple way of fixing it). —, A is short for A > f where f acts like 
0 = 0’. It is important to remark that this concerns only — in the usual 
contexts of mathematical logic, because the appearances of ‘not’ in the 
names of (VIII) on p. 16 cannot be reduced to implications. The quantifiers 
are introduced before the connectives and afterwards, e.g. conjunction is 
defined as a universal quantifier with a finite range. This provides me with 
connectives symmetrical from the beginning. (x) is called the universal 
quantifier: (x)A(x) means the presence of a method enabling one to accept 
A(x) for every value x of x that has occurred. (Ex) is called the particular 
quantifier and (Ex)A(x) means that there may occur (with an arbitrary 
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modal characteristic and tense) an x for which A(x) can be accepted. Here 
the modality of x in both quantifiers is arbitrary, but if it is ‘real’ these 
quantifiers become the usual Vx and 3x (used in the explication of the 
connectives). Some tactic of acceptance of sentences is here borne in mind. 

It is not difficult to construct a deductoid for two formulas A and — A 
and even to find an establishment of convincingness for each of them in 
such a manner that if one forms a further deductoid for A & — A one 
cannot establish its convincingness. More precisely, this impossibility holds 
only if the structural identification of both A’sin A & — A is to be fulfilled. 
There are cases when this identification leads to a violation of the condition 
on the arrows, and there are also other cases when this identification causes 
a Zenonian situation or catching (e.g. with the field A). These are the 
apparent contradictions or contradictoids and they don’t disturb the program. 
Some of them arise if one tries to prove with the help of T3 the isomorphism 
of two different natural number series: after replacing T3 by (Ca) and 
analyzing the arrows one finds a violation of the condition on the arrows. 
There are also contradictoids arising from considerations of the ultra- 
intuitionistic model for ZF (see below). E.g. one can in many ways indicate 
a set x and choose as A the sentence ‘x is finite’ (in some set-theoretical 
meaning of finiteness). Another contradictoid arises if one finds in this 
model a set y violating the axiom of choice and takes as A the sentence 
‘y is well-ordered’. (Besides the consistency of ZF I obtain ultra-intuition- 
istically Cohen’s result about the independence of the axiom of choice and 
the continuum-hypothesis; but not his result about the independence of the 
continuum-hypothesis from the axiom of choice in ZF.) It is worth noting 
that contradictoids disappear if one excludes the identification of A in the 
text of A & — A. Then the formula A & — A ceases to express a contra- 
diction. In connection with T11 I must say that conjunctions A & B may 
differ according to the possibilities that parts of A and B are or are not 
collated to one another. If one has a proof of A and another one of — A 
then the collations of different parts of A within the first A belong to the 
first proof, and those within — A to the second; but if one merely combines 
both proofs one makes by this action no identification of both A’s in the 
theorems A and — A. On similar grounds it is possible that there are two 
theorems A and A > B such that B is not a theorem. 

Sometimes I use the possibility of considering the texts of conjunctions 
A, &...&A, with no identifications of the parts of the different A,’s. 
This enables me to consider some important parts of deductoids as sym- 
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metrical. E.g. in the axioms A, &... & A, > A, (1 < j < 2) the identifica- 
tion of both A,’s destroys the symmetry of the conjunction. But if one 
analyzes the aims of collations one remarks that this identification is un- 
necessary for establishing the fact that implication is an axiom, provided 
that one knows that A; in the right-hand part of it is a member of the 
conjunction in the left-hand part of it. So one can exclude some of the 
customary identifications as useless for logical aims and reestablish the 
symmetry of many parts of deductoids. This symmetry is used in proving 
that these parts cannot be fields of catchings. It is only one of many similar 
devices of my program. E.g. I use the fact that in applications of (Ca) 
there is no need to collate the parts of different premises P(m) and P(n). 
Of course the exclusion of the customary identifications leads to a revision 
of T11. But as I have mentioned I can reestablish the substantiations of the 
traditional axioms of the intuitionistical predicate calculus. 

This belongs to a brief description of my genetic theory. This theory is 
essentially more complicated than the ontological theory. (That is natural 
because the notion of a proof must be more complicated than that of a 
natural number series.) 


13. Now I return to a brief discussion of the ontological theory. 

Let Ky be a natural number series containing a number k but not k+2 
(see above, p. 27). Then Ky does not contain 2*. 

For each Kg-number p I denote by f the totality of all Ko-numbers < p. 
(I call p the genesis of the Ky-number p and similarly I call the genesis 2 
of an event z of an arbitrary discrete process D the totality of all those events 
which have necessarily occurred provided z has.) For each Ky-number p 
I consider now the application of c.o.h. with the basis p and the load con- 
sisting, for 0 ep, in the formation of {0} (i.e. in the indication of a void 
place) and for each i+1€/ in the duplicating of the object obtained as 
the result of the ith load. So for each i+ 1 € p I obtain the totality of strings 
do... 4:41, where each a; (j < i+1)is 0 or 1. L apply the tactic of collations 
which enables one to identify 00... 0a;...a;,, of the (+1)st load with 
the a;...a;4, of the (i—j)th load. Now I apply c.o.h. once more with the 
basis p and with the (i+1)st load consisting in looking over the strings 
do... @;4, in their natural lexicographical order. This load is feasible if 
the ith such load is because the (i+1)st load consists in repeating the ith 
load two times (the first time in the form a,...a;,, or 0a,...a;4, of 
the strings, and the second time in the form la, ...a;,, of the strings with 
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the same ordering of a,...a;,,); here the step 10...0 is to follow the 
step 01...1, which causes a fold. Then I apply c.o.h. with the basis p 
and the load consisting for each i+ 1 € p in the repeating of the foregoing 
(i+ 1)st load without taking the fold into attention (the result of this load 
being the consideration of the strings a)...a@;4, aS appearing in lexico- 
graphical order). Now I consider the class K9 of all appearances of such 
strings dg... @;4, with i+1¢p where p is a Ky-number, the strings with 
different i’s being collated according to the tactic described. The operation 
x+1 or x’ is introduced for these appearances in the natural manner; and 
then the class Kj becomes a class of events occurring in the course of 
following a method which defines a natural number series. In the theory of 
collations I establish that this class Kg may be identified with this course 
itself, i.e. with a natural number series. (At least I show that the class of 
texts involving those wherein this identification is allowed is sufficiently 
large to include the texts I need in the central nucleus of my program.) 

So given a natural number series Ky with a Ky-number & such that 2* 
does not belong to Ky I obtain a new natural number series K$ containing 
numbers < 2! for each Ky-number i, and no other numbers. (Of course when 
I say this, I identify the equivalent Kj- and K%-numbers, which is not 
always allowed.) From the ontological standpoint the fact that KG is a 
natural number series means nothing but that the appearance of its events 
can be described by a method of the kind used in the definition of the notion 
of a natural number series and the fact of the possibility of K9 with the 
indicated property means nothing but the possibility of obtaining by means 
of c.o.h. the feasibility of each of the events of Kj. Further it is evident 
that if Ky < qe K, then Kj < 2%e K7. Also (b) K@ is closed with respect 
to the sum-operation a+b defined in a natural way. If g = 2* then 
q¢€ Kg < 2%, so (c) there is a natural number series KG closed with respect 
to a+b but explicitly not closed with respect to 2° and (d) one can construct 
many series Kj < K** <... each of them satisfying the property (c). 

Now I fix a natural number series L having property (c), e.g. L = K@, 
and I introduce the series My = Kj, M, = Mj,... Mis, = Mj... 
where i € L. For each L-number i the introduction of M; is made by means 
of c.o.h. with the basis 7 and the load consisting, for 0 €7, in obtaining the 
introduction of Kj, and for each e+1 €7, in obtaining K* from the series 
K obtained in the result of the eth load. (So by c.o.h. the names of these 
series are feasible and their events are only the points of indications allowed 
by S1.) Finally I obtain a natural number series M, whose events are those 
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belonging to some M,, ie L with the natural tactic of collations identifying 
equivalent numbers of different /,’s. Now it is evident that (e) M, is closed 
with respect to 2* and (f) if LZ, < L, then M,, < M,,. Also it is easy to 
show (g) L < M,. For the considered series L it is easy to introduce an 
operation g(L) such that L < @(L) and (ZL) is closed with respect to 
a+b (e.g. (Ko) = Ko, o(L) = L*', (Mi) = Misi, 9(Mz) = Mau). 
Let M, be M,,,). Then (e) and (f) hold with M, replaced by M, and (g) 
can be strengthened to (g*) L < M,. 
If me M;, then 
Pus 


2° 
2 , 


where 2 occurs g times belongs to M3, provided g EL. (This follows from 
i+géL.) I denote by 2 the ath superpower of 2 defined by the equations 
2) = 1, 2 = 2? If m = 1 then the result obtained shows that ae L 
entails 2 e M, (and a fortiori 2 ¢ M,). q ¢ Ko < 2! gives ge My < 22 
and further 


226M, <2%eM, <2” eM; etc. 


the number 
6 7K! 


ae | (e M,) 


with i 2’s being obtainable by means of c.o.h.; and L = Ko < 2! gives for 
each je L 
6 74 


Wek, 


where 2 occurs 2? times whence 
24 


Mie Oo 


with 27 2’s"And 27 < 2" pives*y, = 22"" UN So Mi ee 
but ge Ko= Mp gives 27? ec Wf'S M, S02?’ +e Mee ee 
similar way one can obtain an r such that re M, < 2.) Now the operation 
M, over L behaves itself towards 2 just as K* does towards 2° (with M, 
in the réle of Kg and 2%*1 in the réle of k); therefore I can repeat the 


+ Here it is supposed that L < L*. 
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construction used for K§ and 2% and obtain a series closed with respect to 
2 and (with the help of (g*)) even many such series N,, N2,.... 

One can go further and obtain series closed with respect to the operations 
2 defined by 29) = 2 and 2{9, = 1, 20%) = 201°), One can even 
replace here the equation 2{9, = 1 by 2§9, = 2$. For each i from a series 
L one can obtain another series N, < N, <... closed with respect to 2!”. 
One can even obtain series closed with respect to 2!” as a function of two 
variables i and m. According to a well-known result of R. Peter, for each 
prf g(n) one can find an i, such that g(m) < 2{”; so one can obtain a 
natural number series N closed with respect to any prf and even many such 
series Ny, N,... (The index i in N; may belong to any previously con- 
structed series.) 

Of course ultra-intuitionistically the notion of prf is to be relativized and 
generalized. Let prf be defined as in Kleene’s book ‘Introduction to meta- 
mathematics’. Then each prf can be obtained from some initial prf’s in a 
number of applications of two functionals: (a) the schema of composition, 
and (b) the schema of primitive recursion. This number is now a N-number, 
N being a parameter. (One of the initial functions also depends on such a 
parameter; but that is not very essential.) Further one can now consider 
prf’s defined on different series and taking their values in others. To the two 
mentioned functionals I propose to add still two functionals consisting 
merely in replacing an argument or value by its equivalent in another series: 
(c) continuation and (d) restriction. The réle of these functionals is to 
replace one series by another in the trivial manner, the resulting function 
being undefined if one cannot obtain its value in this way. This is the most 
essential ultra-intuitionistical generalization of the notion of prf; so one 
obtains the notion of prf defined by means of various natural number series. 
The notion of partial recursive function remains essentially the same, 
insofar as it depends on the calculations, where the latter are to be feasible 
on the ground ofc.o.h. It is easy to show that each partial recursive function 
x(x1,-.-x,) is a prf in the sense just introduced. Namely, let 
W(uy[y(%1,---%,,”) = 0]) be its traditional Post normal form — by a 
continuation of the series considered one gets its applicability (i.e. closure 
with respect to y and x). Now for the operator py[x(x,,.- +s Xa») = 9] 
I consider its representation in terms of the function t defined by a single 
equation 7(z’, 0, y) = y (see Kleene’s book § 57). Let N denote the series 
on which these functions are now considered and N <qeN. (For the 
series considered one always can find such an N and q.) Let 7 be defined 
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on N by the equations 7(z’,0, y) = y, 7(0, u, y) = T(v, uv’, y) = g. Then 
uy ly(X1,---s Xn») = 0] can be defined on N as the restriction to N of 
the prf 
Hy eC Gus - pes), | XG 20 ea) = y] 
y<q so s<y ssy 
andiso'y(<1. eis canal. 

Of course the result obtained before about the possibility of series N 
closed with respect to every prf is to be understood as referring to prf’s 
defined by means of N-numbers only. Even this notion is now relativized 
and one can find series N; < N, <... with this property. It may be that 
these prf’s on N, majorize Ackerman’s function on N,, and so on. 


14. But in order to prove the consistency of ZF with k inaccessible cardinals 
one needs only the k+2 series K < No <~ Ny <...~< N,, the N;’s being 
closed with respect to superpower and the possibility of every N,-number 
k’ of such No < Ni <... < Nj. For the pure system ZF it is sufficient to 
consider K < Ny with possibility of varying N, by choosing its values 
Ni < NO <...< N@ for each N-number m. By means of Ni, I construct 
a model D' for the system ZE ; equiconsistent with ZF without the axiom of 
infinity and I have to introduce the latter into D' *. In this model D! there 
is a set m, of 7 ‘urelemente’, and I choose K such that me K entails 
m+2?' eK. Then I introduce intentionally a Zenonian situation with the 
help of a 1-1 function defined on K and taking its values in m,, so that the 


set m, becomes infinite. After this I have to verify the axioms of ZF; (in- 


ad 
* The main differences between ZF; and ZF are the following: 


md 
(a). the axiom of extensionality is missing in ZF, and a new functional symbol g(x) 
with axioms x € q(x), and YWu(uex —ue y) D g(x) = G(X) is introduced in order to 
obtain the equiconsistency with ZF; 


~~ 
(b). the logic of ZF; is intuitionistic and all 3x in axioms are replaced by — Wx — 
(likewise for all disjunctions); 
(c). for the atomic formulas x € y and x = » the law 4 TA 3D A is accepted. 


~~ 
(c) enables one to prove the consistency of ZF; with classical logic. But (b) entails that for 
the set-theoretic functions (and even for those representing prf’s in ZF) instead of existence 
~ 
only the double negation of existence is provable in ZF,;. This explains the fact that the 
~~? 
closedness of No with respect to superdegree enables us to construct a model for ZF;. 
(d). The axiom of infinity is replaced by the introduction of a new alphabet «a, f,... 


of variables for the elements of m, and of the arithmetical signs 0 and ’ with Peano’s axioms 
Vaa’ 0 and VaV6(a’ = B’ >a = 8B). 
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cluding the axiom of infinity) for the resulting model. I construct a deductoid 


for each of them, and I show that if there were a contradiction C in ZF i 
containing < / formulas then by replacing in C the axioms by their deduc- 
toids I obtain a deductoid D for a contradiction (not only a contradictoid!*) 
and an establishment of convincingness for D. Here I use essentially the fact 
that the sets of the model D' are symmetrical, and so the main parts of the 
deductoids may be constructed as symmetrical also: in this way the most 
disturbing catchings are avoided. (More precisely, for k = 0, I construct 
deductoids in such a way that all objects taking part in it and depending 
on / are symmetrical. These cases are the most difficult of all.) 

Of course all this is but a very short sketch, in which only a very small 
part of the consistency proof for ZF is indicated. All technical parts of this 
proof are investigated by me in complete detail and exposed in many 
manuscripts. The most urgent of them is ready for publication. But the size 
of all these manuscripts exceeds that of Kleene’s book and that prevents 
me from publishing them quickly. In this most urgent text, the demonstroids 
are written down, as well as an exact description of the establishment of 
their convincingness and the chief ideas needed in carrying it out. After this 
text becomes available to my colleagues I shall be able to answer further 
questions with a suitable degree of exactness. But all this technically is a 
very large program and my requirements as to rigour surpass my ability of 
writing quickly. So many questions will remain and I hope they will be 
settled in the course of further study. If I have a success even today, it 
consists in the fact that these questions are essentially deeper than all 
questions known independently of my program. 

Of course, from the ultra-intuitionistic standpoint the traditional axiom- 
atic set theory ZF is but a poor fragment of our thinking.’ It is to be supple- 
mented by the theory of modality and the whole domain of research of my 
prototheories, and this can essentially be done with the help of my method 
for founding it. But the resulting theory is of course not an axiomatic one. 


15. The ultra-intuitionistic program is also an ultra-pedantic one. In order 
to fulfil its requirements completely it is necessary to construct theories 


* More exactly C is considered as a formal proof of 0 = 0’ in ZF, and Dasa demonstroid 
for 0 = 0’. 

t Also I have to note that ZF is only one way of formalization of Cantor’s set theory and 
therefore its formalization must not close the domain of research in foundations of set 
theory. E.g. one has to investigate the consistency of Quine’s system ‘New Foundations’. 
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about each grammatical category unavoidable in the exposition of this 
program. That is possible, but still not fulfilled in all detail. So today there 
are questions answered only intuitively. There is a kind of intuition un- 
avoidable in this research. To this intuition belongs the understanding of 
aims of each use of a sign or of each other elementary act of thinking. 
I aspire to describe also this intuition by some methods but then new 
elementary acts are required and I am forced to indicate some external 
tactic followed only intuitively. That is unavoidable. But one of the extreme 
directions of the program consists in reduction of all uses of intuition to the 
intuitive understanding that the signs used correspond to their aims. I call 
this direction pragma-ultra-intuitionism. 

At any rate nobody can dispense with a kind of confidence to his own 
memory in his study of a theory. Of course, that is true for my program, too. 
I argue that this kind of confidence is different from faith in the ordinary 
sense, because it is not used as an argument in a proof or deduction. 
Nevertheless the logic of confidence governing this kind of confidence is 
required in my program in order to justify confidence in memory too. This 
logic deals with the transition from ‘A asserts B’ and ‘I believe A’ to ‘I 
accept B’, and with many rules of preference of one source of confidence to — 
another one. I recall that the ethical term ‘fair’ enters in the general explica- 
tion of ‘proof’, and I aim to establish an ethical theorem that it is better 
to accept memory as a source of confidence than to reject it. (If I prefer A 
to B, I say A is better than B; this explication shows why I have always 
to prefer the better; and if sometimes I say I refuse to do it the reason is a 
play on words based on the presence of several tactics of preference.) The 
principle of tautology is considered as the best source of confidence. 

I introduce the extreme directions in order to state that today all unsolved 
questions essentially belong to the extreme directions. So in order to show 
today’s state of affairs in my program I can list these extreme directions. 
They too have been studied by me in part. I can indicate today the following 
seven directions (in their names I use the abbreviation ‘uism’ for ‘ultra- 
intuitionism’ ): 

(1). ultra-ultra-intuitionism (briefly: uduism) — the construction of the 
theory of disputes — the logic of confidence and the relevant parts of ethics; 

(2). extra-ultra-intuitionism (briefly: eduism) or relevantism — the sub- 
stantiation of the hypothesis on obstacles, or its abolishment from our 
considerations; 

(3). trans-ultra-intuitionism (briefly: teduism) — the same for c.o.h.; 
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(4). pragma-ultra-intuitionism (briefly: peduism); 

(5). lega-ultra-intuitionism (briefly: eluism) — the foundations of the 
primary permissions and the deontic relations between different extreme 
directions; the relevant parts of ethics; 

(6). nega-ultra-intuitionism (briefly: neguism) — the substantiation of the 
principle of the negative evidence; 

(7). bi-nega-ultra-intuitionism (briefly: bineguism) — the substantiation 
of the elimination of double negations in the foundations of prototheories. 
Some questions of the same type are still remaining in the foundations of 
the genetic theory, but here there is evidence that they can be settled essen- 
tially by means of Gédel’s imbedding operations. In many cases this elimina- 
tion requires the consideration of the double negation of a possibility as a 
new kind of possibility, for which pmf and other modal principles can be 
proved. Generally to this direction belong only such questions that can be 
dissolved in the considerations of modalities. 


COMPUTABLE ANALYSIS AND DIFFERENTIAL EQUATIONS 


OLIVER ABERTH 


Computable analysis is an analysis restricted to the field of computable num- 
bers, where a real number is called computable if there is an algorithm for 
obtaining arbitrarily precise rational approximations. Algorithms are the 
basis also for the definitions of the functions and sequences of computable 
analysis, and in every instance of a number, function or sequence, an ap- 
propriate defining algorithm is assumed available. This constructive point 
of view leads the analysis to take on many intuitionistic aspects. 


1. Effective methods. A novel feature of this analysis is the wide use that 
can be made of the concept of ‘effective method’. For a given computational 
problem, it is natural to ask whether there is a constructive method of 
finding an algorithm defining the computational result if the algorithms de- 
fining the input data are known. To put this more precisely requires entering 
into the details of what constitutes an algorithm. Here we use the terminolo- 
gy of a recent exposition of computable analysis [1], where the formal con- 
cepts of ‘program’ or ‘programmable function’ replace our undefined term 
‘algorithm’. Np denotes the ‘descriptive integer of the program P’, a unique 
positive integer assigned to the program P. 

There is then an effective method of solving a given computational pro- 
blem if there is a programmable function F(Np,, Np,,---, Np,) such that 
if Np,, Np,.---» Np, are the descriptive integers of programs defining the 
input numbers, functions, or sequences, then F defines the computational 
result. (F may have additional arguments besides the ones shown, in ac- 
cordance with the type of result that F is to define). A computational pro- 
blem for which such a programmable function F exists is then effectively 
soluble; whenever it can be shown that no such F exists, the problem is 
effectively insoluble. 

A simple example of the employment of these concepts is the following 
computational problem: Given a (computable) number x defined by the 
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programmable function a(e) (a(e) is a rational approximation for x with 
error not greater than the rational e), find a decimal approximation d, with k 
decimal digits after the decimal point such that |d,—x|<rx107*, ra 
prescribed rational number. r is usually taken as 4, and often there is an 
additional requirement of the evenness of the last decimal digit when the 
maximum error of $x 10~* is attained. 

Even ignoring the additional requirement, for r = 4 this problem is effec- 
tively insoluble, theorem 14 of {1] providing an easy proof. Since there is no 
difficulty here for the special case when x is a rational number p/g, we can 
show that for r > 4 the problem is effectively soluble: take d, to be a decimal 
number closest to «((r—4) x 107“). 

An interesting convenient choice for ris 3 = 0.5. In this case from d, we 
may obtain d,, 1 < j < kK—-1, by adding 5x 10-4*» to d, and then dropping 
all digits beyond the jth decimal place. Similarly, if r equals 5 x10°~*/ 
(10°—1), 1 < s < k, then the described process is correct for 1 < j < k-s. 

The identification of unreasonable computational problems that is pro- 
vided by the concept of an effectively insoluble problem may be useful in nu- 
merical analysis. 

In the remainder of this paper we treat some topics of differential equations 
from the point of view of computable analysis, to bring out a few differences 
in comparison with real analysis, and other applications of the concept of 
effective method. 


2. The differential equation dy/dx = f(x, y). Our first theorem is similar to 
a standard ‘existence theorem’ of differential equations. 


THEOREM 1. Given the differential equation dy/dx = f(x,y) with f(x, y) 
uniformly continuous in a rectangle R= {(x, y): x [a, bj, yé [c, d]}, with 
f(x, ¥)| < Mand | f(x, ¥1)—f(%, ¥2)| < Lly1—ye2l for points (x, y), (x, V1); 
(x, ¥2) in R and positive constants M, L, then for any prescribed constant yo 
with (a, Yo), (b, ¥o+ M(b—a)) in R, there is an effective method of finding a 
uniformly continuous function y(x), x in [a, b], and its derivative y'(x), such 
that y(a) = Yo, y'(x) =f (x, »(x)). Moreover, this function is unique. 


The usual constructive techniques of Cauchy and Lipschitz may be employ- 
ed in the proof of this theorem to show the effective method and uniqueness 
of y(x) (cf. Ince [4], pp. 75-81). The uniform continuity required for f(x, y) 
cannot be replaced by pointwise continuity. 
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THEOREM 2. Under the same hypotheses as in the previous theorem except 
that the Lipschitz condition is dropped, for any positive integer n there is 
an effective method of finding a function y,(x), x in [a,b], and its derivative 
y,(x) such that y,(a) = yo, |yn(x)—f (x, »,(x))| < 1/n. However, there is an 
J (x, y) such that for appropriate yo there is no effective method of finding a 
function y(x) satisfying y'(x) = f(x, y(x)), ya) = Yo- 

Proor. There is no difficulty in constructing a uniformly continuous 
function f,(x, y) such that | f,(x, y)—/(x, y)| < 1/n in R and f,(x, y) satisfies 
the Lipschitz condition for L=L,. Then by theorem | we may find y,(x) with 


[ya(x)—F(%> Yul) = (Fn Ynlx) —F% Ya(%))] < Ln. 


To prove the second part of the theorem, consider the function f(x, y)=3y?, 
with [a, b] = [0,1]. If yo > 0, y = (x+|yol*)? so that y(1) > 1, and if 
Yo < 0, y = —(x+ yol*)? so that y(1) < —1. That these functions are 
unique may be shown by applying theorem 1 for an appropriate rectangle 
R. If yo = 0 any of the following functions is a solution: y = 0 for xin 
[0, x9], »y = £(x—xp)* for x in [x9, 1], 0 < x9 < 1. 

If there were an effective method of obtaining a solution y(x; yo) for any 
Yo in an interval containing 0 as an interior point, say [—1, 1], then we 
arrive at a contradiction. Define the programmable function Q(z) as follows: 
For n = Np, set Yo = ap (cf. proof of theorem 21 of [1] for the definition 
of ap) and decide on one of the following as true: y(1; ap) > —4 or y(1; ap) 
<4. In the first case Q(n) = +1, in the second Q(n) = —1. Thus if 
P(Np) is defined, Q(Np) = +1 if P(Np) < 0 and —1 if P(Np) > 0. But 
then if Ng is the descriptive integer of a program realizing Q, whichever 
value Q(Ng) assumes leads to a contradiction. 


3. Linear differential equations with constant coefficients. To obtain an 
effective method of solving the equation y+a,_,y"" P+ ... tayy’+ 
+aoy = 0, a; constants, with the initial conditions that y(0) are specified, 
k =0,1,...,n—1, a slight change from the usual approach is necessary. If 
risa complex root of multiplicity m for the polynomial 1"+a,_,f°°'+ ...+ 
+a, then ze”, 7 = 0,1,..., m-—l1, are m linearly independent complex 
solutions to the differential equation. We apparently have an effective 
method of solving our problem: Since a general solution is )}-, ¢y,(z), 
y; a complete set of the above solutions, we may adjust the constants c; to 
satisfy the initial conditions and then take the rea! part as our solution. 
However, the method of obtaining the specific solutions y; is flawed be- 
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cause there is no effective method of determining whether a polynomial 
with computable number coefficients and degree > 1 has multiple roots, or 
even the multiplicity of a given root [2]. To get around this difficulty, note 
that if r,,r, are distinct, the divided difference 


is also a solution. Using this function, e"!”, and further divided differences> 
we obtain v linearly independent solutions given by 


00 zi 
= : stato i 
ys(z) a E FLO » Myo g0 ...Ps*s 
k=s-ik!  i;20 
ign¢+... tis= 
=k-st1 


fors = 1, 2,...,”, where the roots 7,, 75, 2... f, need no longer be distiner 
These functions can be used to obtain the effective method of solving the 
differential equation under the initial conditions. 


4. A solution to the equation 4u = 0 with no analogue in real analysis. - 
Since in computable analysis there are functions pointwise continuous over 
a closed interval but not uniformly continuous there, the possibility arises 
of finding such functions as solutions to differential equations. This could 
not occur with the linear differential equation treated in the previous sec- 
tion; a generalization of theorem 1 applies there. However, for Laplace’s 
differential equation 
Ou. ou tee 7 ou | one 
1a (ayy oe 
r r° 00 


or 
we can show that non-uniformly continuous solutions exist. 


THEOREM 3. There is a function u(r, 9), not identically zero on the unit disc 
0 <r<_1 though equal to zero on its circumference, pointwise continuous 
but not uniformly continuous on the disc, and satisfying the equation Au = 0 
inside the disc. 

Proor. We may enumerate those descriptive integers Np such that 
P(2~ “r*?)) is defined: N,, N2,.... If the correspondingly defined rational 
numbers are r,,r2,..., define g,(@) to be a periodic function, with period 
2n, such that g,(@) = 0 for |@—r,| < 27%"*, ¢,(0) = 1 for (@—r,| = 27" 
and —z < 0—r, < a, and g,(9) is given by the obvious linear relations for 
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intermediate 0. If h,(0) = []i-1 9,(0), then in a period 27 this function is 
unequal to 1 on intervals whose total length is less than 257, 27* = 2. 
Now if 


1 2n 22 
a(k), =| h(@)cos kode, bk), = + | h,(0) sin k0d0, 
To TJo 
then these sequences are Cauchy and hence have limiting values a(k), b(k). 
For r < 1 we define our desired function by 


nF, 0) = 4a(0)+ r* (a(k) cos kO + b(k) sin k6). 


By differentiation of the series, u may be shown to satisfy du = Oforr < 1. 
If u,(r, 0) is given by 


u,(r, 8) = 4a(0),+ Y r*(a(k), cos k0+b(k), sin k0), or <1, 
k=1 
an alternate form for u,(r, 0) is the Poisson integral (cf. [5], pp. 270-277) 


ie ae 1—r? 
urs 0) = | Ho) 


—_—________—— dg, r<l. 
r?—2r cos (0—@) +1 ss 


For any angle 0’ = {a(e)}, if N, is a descriptive integer of a program re- 
alizing a, «(27 ‘%=*7)) is defined, so we may find a large enough n, say no, 
such that /,,(@) = 0 on an interval containing @’ as an interior point. Em- 
ploying the Poisson integral, for any positive number e we can show in the 
usual manner that 0 < u,,(r, 6) < e for (r, 6) inside the unit disc and suffi- 
ciently close to the point (1, @) on the circumference. The same applies to 
u,(r, 0), m > no, by comparison of their Poisson integrals with that of 
Ung(X, ¥), and by an easy additional argument also to u(r, 0). Thus we may 
set u(r, 0) equal to 0 on the circumference of the disc and obtain a compu- 
table function defined everywhere on the disc. 


5. Effective methods and well posed problems. In real analysis, to make the 
problems of partial differential equations conform more with physical 
reality, the concept of a well posed problem, where the solution varies 
continuously with the initial values, is widely employed. Hadamard’s 
famous example of a problem not well posed was that of determining a 
solution to Au=0 with u(x, 0), u,(x, 0) prescribed. The functions 
u,(x, y) = (sinh ny sin nx)/n? satisfy the initial conditions u(x, 0) = 0, 
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u(x, 0) = (sin nx)/n, but as n - 00, u,(x, y) does not approach the solution 
u(x, y) =0 though the initial conditions approach u(x, 0) = 0, u,(x, 0) = 0. 

In computable analysis the concept of an effectively soluble problem 
may be used as a substitute for the idea of a well posed problem. For in- 
stance, it is not difficult to show that for functions g(x) not unequal to all 
the functions 0, (sin nx)/n, there is no effective method of finding a function 
u(x, y) not unequal to all the functions 0, (sinh ny sin nx)/n?, with 
Aes 0) =U ves) = ces), 

A formal proof can be given as follows: Take g(x) equal to |ap| sin (x/|ap|). 
|ap| is a reciprocal of a positive integer or is 0 according as P(Np) is or is not 
defined. If |ap] = 0, x/|ap| is undefined, but in this case the coefficient 
|ap| allows us to take g(x) = 0 and obtain a computable function. If there 
were an effective method of obtaining u(x, y), then since u(I, |ap|) > 4 if 
P(Np) is defined and u(1, |ap|) = 0 if P(Np) is undefined, we obtain an ef- 
fective method of determining whether or not P(Np) is defined, contradicting 
theorem 13 of [1]. 
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MATHEMATICS AS A NUMERICAL LANGUAGE 


ERRETT BISHOP 


The one point on which constructivists agree is their criticism of classical 
mathematics. Brouwer’s great contribution was to analyze intensively the 
inadequacies of that system. After Brouwer and Kronecker, we all know a 
reformation is needed, but disagree about what course it should take. 
Intuitionism, as developed by Brouwer, stresses as basic our intuition of 
the integers and our intuition of the real numbers; all of mathematics is to be 
reduced to these two primitive constructs. In my book [1] I proposed, 
in the spirit of Kronecker rather than Brouwer, that the integers are the 
only irreducible mathematical construct. This is not an arbitrary restriction, 
but follows from the basic constructivist goal - that mathematics concern 
itself with the precise description of finitely performable abstract operations. 
It is an empirical fact that all such operations reduce to operations with 
the integers. There is no reason mathematics should not concern itself with 
finitely performable abstract operations of other kinds, in the event that 
such are ever discovered; our insistence on the primacy of the integers is 
not absolute. 

Thus by ‘constructive’ I shall mean a mathematics that describes or 
predicts the results of certain finitely performable, albeit hypothetical, 
computations within the set of integers. If a word is needed to delimit this 
special variety of constructivism, I propose the term ‘predictive’. From 
the predictive point of view, Brouwer’s intuitionism at first glance contains 
elements that are extremely dubious; free choice sequences and allied 
concepts admit no ready numerical interpretation. Moreover, the numerical 
content of intuitionistic mathematics is diluted by over-reliance on 
negativistic techniques. The role of negation in predictive mathematics is 
philosophically secure, if only because there exist negative statements that 
do have numerical content. Nevertheless, it is remarkable that a systematic 
effort to avoid negation leads uniformly to better results. 

Constructive mathematics is in its infancy. According to some, it is 
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doomed to the role of scavenger. These people conceive of classical mathe- 
matics as establishing the grand design and the imaginative insight, leaving 
the constructivists to add whatever embellishments their credos demand. 
Although totally wrong, this viewpoint hints at a truth: The most urgent 
task of the constructivist is to give predictive embodiment to the ideas and 
techniques of classical mathematics. Classical mathematics is not totally 
divorced from reality. On the contrary, most of it has a strongly constructive 
cast. Much of the constructivization of classical mathematics is therefore 
routine; constructive versions of many standard results are readily at hand. 
This makes it easy to miss the point, whichis not to find a constructive version 
of this or that, or even of every, classical result. The point is not even to 
find elegant substitutes for whole classical theories. The point rather is 
to use classical mathematics, at least initially, as a guide. Much will be of 
little value to the constructivist, much will be constructive per se, and much 
will raise fundamental questions which classically are trivial or perhaps do 
not even make sense. The emphasis will be on the discovery of useful and 
incisive numerical information. It is the incisiveness and scope of the 
information, not the elegance of the format, that is relevant. : 

A given classical result may have no, one, or many constructive versions, 
none necessarily superior to the rest, because different constructive theorems 
can represent different numerical aspects of the same classical result, all 
giving different estimates, useful for different ends. In many instances a 
classical definition, which makes good constructive sense, no longer rep- 
resents the correct point of view, and so must be replaced. Finding the 
correct replacement is often a non-trivial challenge, involving considerations 
which from the classical standpoint would be absurd. I believe that even- 
tually the influence of constructive on classical mathematics will be greater 
than the influence the other way. Very possibly classical mathematics will 
cease to exist as an independent discipline. In the meantime, it behooves 
constructivists to attach their mathematical and philosophical investigations 
to mathematics as it exists. Contrary practice has led to numerous irrelevan- 
cies and misplaced emphasis. Even a quick look shows that much of the 
constructivist literature lacks the serious intent the subject demands. 

In this short paper I want to indicate what to me are some of the important 
questions of constructive mathematics today. First, I wish to discuss certain 
mathematical problems, not because of any special interest or difficulty of 
these particular problems, but as an indication of the sort of work that 
needs to be done. 
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The first example is taken from probability theory. (Modern probabilists 
seem to have little or no interest in the computation or even the computa- 
bility of the probabilities with which they deal. It is not surprising the 
subject is blatantly idealistic.) The Birkhoff ergodic theorem asserts that if 
T is a measure-preserving transformation on a finite measure space and f 
is an integrable function, then the averages 


fo) = FO) 4/(TH+ ... 4f(T"D) 
n+1 


converge as n — 00 for almost all x. Constructively this result fails. For a 
counterexample in the style of Brouwer, take T to be rotation of the unit 
circle through an angle 2na, where « is a real number which, for all we know, 
could be equal to 0, but could, on the other hand, be irrational. On the one 
hand {f"} would converge a.e. to f, and on the other to the constant function 
(2x)~*{ f(@)d@. Thus a constructive proof of Birkhoff’s theorem is out of 
the question. It is shown in [1] that the sequence {f"} in general satisfies 
certain inequalities (of a type first introduced by Doob for the study of 
martingales, and called upcrossing inequalities), which classically imply 
the sequence converges a.e. These inequalities, which from the classical 
point of view constitute a considerable strengthening of Birkhoff’s theorem 
and its principal modern generalizations, would seem to afford a satisfactory 
constructive version of the ergodic theorem, but this is not so. In the case 
of a completely general measure-preserving transformation, the upcrossing 
inequalities are probably satisfactory. In other words, they afford a good 
equal-hypothesis substitute for Birkhoff’s theorem. However, we would 
also like a good equal-conclusion substitute for Birkhoff’s theorem — that is, 
usable conditions on T that imply the constructive convergence of the 
sequence {f”} almost everywhere. This is an important open problem. 

The next example is taken from algebra. Recently I was asked whether 
the Hilbert basis theorem - that a polynomial ring over a Noetherian 
domain is Noetherian - is constructively valid. The answer is easily seen 
to be ‘yes’. Unfortunately, not even the ring of integers is Noetherian from 
the constructive point of view (and therefore the Hilbert basis theorem is 
vacuous). For a counterexample in the style of Brouwer, let {n,} be a 
sequence of integers, for which we are in doubt as to whether they are all 
equal to 0. The ideal generated by the integers m, has no finite basis in the 
constructive sense. The problem is to find a constructively usable reformula- 
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tion of the definition of a Noetherian ring, which would include the integers 
and give constructive substance to the Hilbert basis theorem. 

Our third example, from topology, came as a surprise. Elementary 
algebraic topology should be constructive, but the definition of the singular 
cohomology groups gives trouble. A singular 1-simplex of the unit circle 
S! is a continuous function w from the closed unit interval [0, 1] into S*. 
Let Q denote the set of all such w. A singular 1-cochain c (over the integers 
Z) can be thought of as a function from Q to Z. We wish to define con- 
structively a singular 1-cochain that generates the one-dimensional cohomo- 
logy group of S'. Such a cochain will in particular be a non-constant 
integer-valued function c on 2. Now the set 2 is arcwise connected, in the 
sense that if w, and w, are any points of Q, there exists 2: [0, 1] — Q with 
A(0) = w, and A(1) = @,. A result of Brouwer says that every integer- 
valued function on [0, 1] is constant. A corollary is that every integer-valued 
function on an arcwise connected set is constant. Thus a non-constant 
integer-valued function c on Q would counterexample Brouwer’s result. 
Brouwer’s result has not been counterexampled. These considerations 
indicate the difficulties involved in finding a satisfactory constructive version 
of singular cohomology theory. 

Each of the three problems just discussed requires the development of 
new concepts appropriate to the constructive point of view. None of them 
is likely to be given an acceptable solution by the application of a general 
technique of constructivization. Incisive estimates and apt definitions are 
not to be expected as consequences of general schemes that translate from 
classical to constructive mathematics, although translation techniques may 
have value in special instances, as we shall see later. 

The most urgent foundational problem of constructive mathematics con- 
cerns the numerical meaning of implication. Constructivists have custom- 
arily accepted Brouwer’s definitions of the mathematical connectives and 
quantifiers, implication in particular. According to Brouwer, P + Q means 
that the existence of a proof of P necessarily entails the existence of a proof of 
Q, in other words, there is a method that converts a proof of P into a 
proof of Q. In [1] I gave a variant of this definition, fitted to the predictive 
point of view: P + Q means ‘... the validity of the computational facts 
implicit in the statement P must insure the validity of the computational 
facts implicit in the statement Q ...’. There is a discrepancy between 
even this reformulated definition and the predictive goal, since, as defined, 
P —> Qis not a prion predictive of the results of certain finitely performable 
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computations within the set of integers. Rather than prematurely attempt 
to resolve this discrepancy, in [1] I decided to let the mathematics be the 
test, and found that in actual practice there was little difficulty in giving 
numerical interpretations to statements with implications or even nested 
implications. Although the numerical meaning of implication is a priori 
unclear, in each particular instance the meaning is clear. We are at liberty 
to continue to treat the numerical meaning of implication as being provided 
by the context, but hopefully there exists a philosophical explanation of the 
empirical fact that intuitionistic implication in each instance admits a 
numerical interpretation. Such an explanation requires a deeper analysis 
of the content of a theorem of constructive mathematics. As a point of 
departure for such an analysis, I examined a number of theorems and proofs 
of [1}, and came to the following conclusions. 

A complete mathematical statement — that is, a theorem conjoined with 
its proof and with all theorems, proofs, and definitions on which it depends, 
either directly or indirectly — asserts that a given constructively defined 
function f, from a given constructively defined set S to the integers, vanishes 
identically. In other words, it asserts VxA(x), where A is the decidable 
predicate f(x) = 0 and x ranges over S. 

Most theorems, standing by themselves, are incomplete mathematical 
statements. An incomplete mathematical statement concerns certain entities 
whose constructions are not described in the statement itself. For instance, 
the prime number theorem 


| 
lim ZAM logn _ 
noo n 
implicitly refers to a sequence {n,} of positive integers such that 


n(n) logn _ 


Mick 


n 


whenever n > n,. The integers n, can be extracted from the proof of the 
prime number theorem, presuming it is constructive (as it is). In general, 
an incomplete mathematical statement asserts the existence of an element 
y of a set 7, such that when the rule for constructing y is given the statement 
becomes complete. In other words an incomplete statement has the form 
dyP(y), where P(y) is a complete statement with parameter y. According 
to the above, the incomplete statement therefore has the form 


SyVxA(x, y) (1) 
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where x runs over a set S, y runs over a set T, and A(x, y) is a decidable 
predicate for each x in S and y in T. 

An incomplete mathematical statement is a consequence of its completion. 
Nevertheless, it contains additional information — the intent. If the prime 
number theorem were written in the form 


n(n) log n 


VnVk (» >i —1i 


< K) 

n 

and n, replaced by its explicit definition, we might fail to realize that the 
primary intent was to construct any sequence {n,} making the statement 
true. The use of incomplete statements aids the intuition and saves time. 
Moreover, incomplete statements are structural components of implications. 
An implication P > Q, where P and Q are incomplete statements, means 
something quite different from P’ > Q’, where P’ is a completion of P and 
QO’ of Q. 

Let us grant that mathematics is concerned with statements of the form 
(1), and inquire how an implication P - Q, where P and Q are statements 
of form (1), is to be interpreted as a statement of form (1). The interpretation 
we shall develop is due to Gédel [2]. Let P be 4yVxA(x, y) and Q be 
dvVu B(u, v), so that P > Q is 


JyVx A(x, y) > JoVu B(u, v). (@)) 
Pure thought translates this into 
Vy(Vx A(x, y) > doVu B(u, v)). (3) 


For a given value of y, which we now fix, (3) asserts that in case the statement 
A(x, y) is true for all x then v can be constructed to have certain properties. 
The rule for constructing v will consist of a certain finitely describable 
procedure, some stages of which perhaps assume the truth of Vx A(x, y). 
For example, at a certain stage we may need to know that a certain integer 
d is non-zero, in order to be able to perform a division, and it may be 
necessary to make use of the hypothesis Vx A(x, y) to derive the needed 
inequality d ¢ 0. In fact, such an occurrence is no obstacle to giving a 
universally valid definition of v. We simply take v to be some convenient 
constant in case d = 0 (which constant does not matter). Such considera- 
tions lead us to expect we can extend out original construction, and give a 
universal construction for an element v of the set in question, with the 
property that in case Vx A(x, y) does hold, then v will have the value 
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originally prescribed. In other words, we conjecture that if we can prove the 
statement 
Vx A(x, y) > JvVu B(u, v), (4) 


we can actually prove the seemingly stronger statement 

do(Vx A(x, y) > Vu B(u, v)), (5) 
which in turn is equivalent to 

JoVu(Vx A(x, y) > B(u, v)). (6) 


Thus we conjecture that P > Q is actually equivalent to the seemingly 
stronger statement 


Vydvvu(Vx A(x, y) > Bu, v)). (7) 
To go deeper, fix values of y, v and u, and consider the statement 
Vx A(x, y) > B(u, v) (8) 


which asserts that the particular decidable statement B(u, v) is a necessary 
consequence of the totality of decidable statements A(x, y) for all x. 
Experience indicates that a proof of a statement such as (8) actually deduces 
the truth of B(u, v) from the truth of finitely many of the statements A(x, y). 
Moreover, reflection indicates the difficulty of exhibiting an instance of a 
proof of a statement such as (8) that actually uses infinitely many of the 
statements A(x, y) to prove the statement B(u, v). Let us therefore conjecture 
that if we can prove (8) we can actually construct elements x,,..., x, 
such that 

A(x,, y)A..-AA(X,5 y) > Blu, v). (9) 


Now (9) is a finitary statement involving decidable propositions. Therefore 
the rules of classical logic hold, so that there exists k with | <k«<n 
such that 


A(x, y) > B(u, v). (10) 
Thus we conjecture that if we can prove (8) we can prove 
Jx(A(x, y) > B(u, v)). (11) 


Thus we conjecture that (7) is actually equivalent to the seemingly stronger 
statement 
Vydovudx(A(x, y) > B(u, v)). (12) 
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Using the axiom of choice, we transform (12) into 


q0AxV yVu(A(X(y, u), vy) > Blu, o(y))). (13) 


This is our candidate for a numerical version of P — Q, and these con- 
siderations probably explain why intuitionistic implication in actual practice 
admits of numerical interpretation. Gédel in [2] proves for a particular 
formal system, designed to accommodate large portions of constructive 
mathematics, that any proof of (2) can be transformed into a proof of (13). 
Whether or not we wish to commit ourselves to this or any other formal 
system, Gédel’s result strengthens our conjecture that (13) is the proper 
numerical version of P — Q, and leads us to define a new type of implication, 
which I shall call numerical implication (or Gédel implication). With P and 
Q as above, P > Q is defined to be the statement (13). Since it appears that 
intuitionistic implication in practice amounts to numerical implication, it 
would seem that for the philosophical unity of predictive mathematics we 
should abandon intuitionistic implication and work with numerical 
implication exclusively. Before we definitely accept such a change, we should 
check in more detail that numerical implication is actually being used, and, 
just as important, that our intuitions can adjust to the change. I believe 
experience will prove that numerical implication is at least as natural and 
easy to uSe as intuitionistic implication. 

Another important foundational problem is to find a formal system that 
will efficiently express existing predictive mathematics. I think we should 
keep the formalism as primitive as possible, starting with a minimal system 
and enlarging it only if the enlargement serves a genuine mathematical need. 
In this way the formalism and the mathematics will hopefully interact to 
the advantage of both. As a point of departure, we take a formal system 
such as used by Gédel [2]. Another version is given in Spector [6], where 
the relevant system is called 2,. It is closely related to the system of Kleene 
and Vesley [3], with the free-choice type axioms left out. 

To give a quick sketch, our system 2 formalizes the theory of functions 
of certain types. The types are defined inductively as follows. The primitive 
type is [0], and a function of type [0] is an integer. If t,,...,t, are types, 
t = (t,,...,t,) is a type, and a function of type t is an n-tuple (f;,...,f,), 
where f; is of type t;. If t; and t, are types, t; — t, is a type, and afunction 
of type t; +t, is a function from the set of all functions of type t, into 
the set of all functions of type t,. The types t, > (t, > t3) and (t,,t,) > t3 
are considered to be the same, and the types t > (t,,...,¢,) and 
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(t + t,,...,t > t,) are considered to be the same. The system ZY contains 
variables of the various types, for functions of the various types. Variables of 
the various types are combined in meaningful ways to form terms. A 
variable of type t is also a term of type t. If wis a term of type t, > t, and 
b a term of type t,, then u(b) is a term of type t,. The constant 0 is a term 
of type [0]. For each term u of type [0], the successor uw’ of u is a term of 
type [0]. If u is a term of type t and x,,...,x, are distinct variables of 
atpittacy types @fy,4.-,¢ then A(x; ,x,)U is a term of type 
(t,,...,t,) — t, to be interpreted as the function whose value at (/,,...,/,) 
is the result of replacing x, by f; in u (1 <i <n). The primitive formulas 
of 2 are of the form u = b, where u and D are terms of type [0]. By means 
of the connectives A, v and —, and the quantifiers J and V, applied to 
primitive formulas, arbitrary formulas are obtained. The statement ‘not A’ 
is defined to mean ‘A — 0 = 1.’ The axioms and rules of inference include 
the axioms and rules of the intuitionistic predicate calculus (rules and 
axioms Al through Al0 and BI through B4 of [6]), axioms for equality 
(axioms Cl through C4 of [6]), the induction rule (rule D of [6]), and the 
axiom of choice for all types (an extension of axiom E of [6]). Also included 
is an axiom expressing the meaning of the j-operator. For convenience 
we might also include axioms for certain functions, as is done in [3]. 
Our first problem is to interpret sets in the system 2. To each set A we 
associate a formula A’(x, y) containing no quantifiers. (Then A’(x, y) is 
decidable for given values of the free variables x and y.) The set A is defined 
by taking xe A to mean VyA'(x, y). (Of course, x may stand for a finite 
sequence (x,,...,X,) of variables of various types, and the same is true 
of y. Note that it would be incorrect to define x € A by a formula of the 
type jz Vy A'(x, y, z); in such a situation, the value of zis necessary to com- 
pletely determine an element of A; hence we should write (x, z)¢ A rather 
than x € A, and define it by the formula VyA’(x, y, z).) Each set A has a 
relation of equality, which means we must define x, =,x, by a formula 


Bevel (oct Xa, V2). 


Of course =, must be shown to be an equivalence relation. The special 
equality x, = x, defined as Vy(x,(y) = x2(y)) is called functional equality. 
Whenever the equality relation on a set A is not defined, functional equality 
will be meant. In general we require the equality relation on any set to be 
weaker than functional equality. If A’ and A’’ contain a variable u, in 
addition to those described above, we have a family of sets indexed by the 
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parameter u. This construction will only be used in the special case of a 
family of subsets, as described below. If A and B are any sets, defined 
respectively by formulas 4’, A” and B’, B’, we define the set C = F(A, B) 
of all functions from A to B as follows. Write the formula 


Yx(xe A > f(x) € B)AVx,Vx2(x, EA AX, EAAX, =4 X2 Of (1) =e f(X2)) 


in the form JuVvC'(f, u, v). Take (f, u) € C to mean WoC'(f, u, v). Similarly 
take Gis u) =c(f; uz) to mean 


Vx(xe A > f(x) =p f2(x)). 


To define a subset B of a set A, according to [1], we must define an element 
x of A, perform the construction of an additional function u, and check 
that certain additional conditions are satisfied. Thus B will be determined 
by a certain formula B’, and (x, wu) € B will mean 


x € AAWB'(x, u, v). 


We take (x,,u,) =5(X2,u2) to mean x, =, xX, . In case B’ contains a 
variable w in addition to x, uw and v, we obtain a family of subsets of A. 
The union and the intersection of a family of subsets of A are defined in 
obvious ways. In case B, and B, are subsets of A, the formula B, c B, 
is defined in an obvious way, and B, = B, is by definition the formula 
B, < B,AB, < B,. The statement B, ¢ B,AB, < By, > B, < B;, for 
instance, is a formula in 2 containing as subformulas the formulas 4’, 
A”, Bj, B; and B3. (This particular statement is of course provable in 2.) 

There is no difficulty in extending the above ideas to complemented sets. 
(A complemented set, relative to a family F of real-valued functions on a 
set A, is an ordered pair (B,, B,) of subsets of A such that for all xe B, 
and y e B, there exists fin F with f(x) # f(yv).) We first run into difficulty 
with Borel sets. In [1] we consider a set A, a family # of real-valued 
functions on A, and a family .4 of complemented subsets of A relative to 
F; we define a Borel set generated by -@ to be a complemented subset 
obtainable inductively from the two following techniques of construction: 

(1). The elements of -@ are Borel sets. 

(2). A countable union (or countable intersection) of Borel sets already 

constructed is a Borel set. 

Properties valid for an arbitrary Borel set are often proved by induction, 
corresponding to the inductive character of the definition just given. The 
type of induction in question also occurs in Brouwer’s definition of the 


A Ill MATHEMATICS AS A NUMERICAL LANGUAGE 63 


constructive ordinals. These definitions seem not to be formalizable in 2. 
To extend the system 2 to subsume the theory of Borel sets, it would be 
necessary to include the type of induction in question. W. A. Howard tells 
me he has constructed such an extension. We shall consider this matter 
no further, since there is another approach, which will be explained later. 

It appears that the theory of the standard abstract structures — groups, 
metric spaces, differentiable manifolds, and so forth, can be developed 
within 2. For example, a metric space (X, p) can be realized in Y by the 
formulas X’, X" determining the set X, by the element (p, v) of the set 
OC OG Ro). and by an element w that arises when the formula 


XeXnvexXaApx, y)=0—>x =, y 


is translated to the form JwVrP(w,t,p)=0. We shall abbreviate 
M = (X'", X", p, v, w), and introduce the notation Me. #&T to represent 
the fact that X’ and X” are formulas defining a set , that p is a metric 
on X, that v and w are the moduli introduced above, and that a certain 
formula Vy.#'(p, v, w, y), which contains X’ and X” as subformulas and 
expresses the fact that p is a metric on Y and v and w are the moduli de- 
scribed, is valid. As a formula of 2, Me W&Z stands for Vy.@'(p, v, w, y). 
We might call 4&7 a large set (or a class), as distinguished from the sets 
already defined (which we sometimes call small sets). 

Although the metric spaces form a large set, rather than a small set, 
the compact (metric) spaces can be regarded as a small set, as follows. 
Let T be the set of totally bounded pseudo-metrics on the integers Z. For 
each p in 7, let Z, be the completion of Z with respect to p. The set of all 
such completions is the set of all compact spaces. 

In the same way, there is a small set of locally compact spaces. 

A (small) category C is an analog of a small set. The set ¥ of objects of 
C is specified by a formula X’, and the equality relation on X is taken to be 
functional equality. Another formula Y’ defines the sets Hom (x,, x) 
of morphisms, so that z ¢ Hom (x,, x2) means Vy Y’(x,, X2, 2, y). A third 
formula gives the equality relations on the sets Hom (x,, x2). There is a 
function e which to each x in X assigns an element (the identity element) 
e, of Hom (x, x). There is a function which to each z, e Hom (x,, x2) 
and z, e Hom (x,, x3) assigns an element (their product) of Hom (x,, x3). 
The usual axioms for a category must be satisfied. 

Of course, all metric spaces, or all groups, or all vector spaces over R 
cannot be regarded as a small category. However, if a suitable cardinality 
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restriction — such as separability or countability — is imposed on the indivi- 
dual objects, many classical categories can be considered as small categories 
in our sense. Examples are the compact (metric) spaces, the locally compact 
(metric) spaces, the countable metric spaces, the (separable) Banach spaces, 
the countable groups, the locally compact (metric) groups, the (metric) 
differentiable manifolds, and the countably generated vector spaces over R. 
Functors are easily treated, and a suitable framework for homological 
algebra is thereby provided. For example, one can formalize the definition 
of the singular homology functor from the small category of locally 
compact spaces to a certain small category of abelian groups, and 
presumably derive in 2 the standard properties of singular homology. 

Following the procedure we used for defining a large set, we can define 
a large category. For example, to make “&Z into a large category, 
we must first define the set Hom (M,, M,) of morphisms connecting given 
objects M,e Wé&ZFZ and M,e WET of METZ. Again the membership 
relation z€ Hom (M,, M,) is defined by a formula Vy.#"'(M,, M2, y, Zz), 
where our notation indicates that .#”’ is a formula with variables p,, v,, 
Wis P2202, W2, y, Z containing X,, X;', X; and X4’ as subformulas. Another 
formula will give the equality relation on Hom (M,, M,). Again the func- 
tion e which assigns to each Me. MET an identity element of Hom (M, M) 
and the function describing the product of morphisms must be given. The 
structure of these functions needs further elucidation. They will not be fixed 
functions of the system 2, because their types will depend on the types of 
their arguments, which are not fixed. Presumably their definitions will have 
the same form independently of type. 

Continuing along the above lines, we should have no difficulty in defining 
functors between large categories. Whether the theory would be useful in 
constructive mathematics is not clear. On the one hand, it is possible that 
small categories are adequate for the applications, but it is also possible 
that something more general than a large category might be needed, to 
define which we would need to enlarge the system 2. 

Consider a positive measure fp on a compact space X, that is, a non- 
negative linear functional f > {fdu on the space C(X) of all continuous 
real-valued functions on Y. We wish to define the measures of certain 
subsets of Y. It would be extremely awkward to attempt to formalize the 
theory as given in [1] in the system 2, because there is no set of Borel sets 
and therefore no set of measurable sets in 2. Thus we must either extend 2 
or take another approach. It turns out that a modification of the approach 
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of [1] gives a theory that is not only formalizable in XY but improves the 
original version. The idea is to define a partial set S to bea triple (f, {f;}, M), 
where fe C(X),0 < f < 1, f;¢C(X) and 0 < f; for each positive integer j, 
and M is a positive integer, such that 


(a). w(S) =lim | f;dy exists, 
jivmo 


(b). fy > min {f, esi, 
(c). fi < fa <. 


Then we define x e S to mean that there exists 6 > 0 with f(x) > f,,(x)+6 
for all positive integers m, and x e ~ S to mean there exists 6 > 0 with 
1—f(x) > f,,(x)+6 for all m. Consider xe S and ye ~ S. Now either 
fu(x) > f(x) or f(x) = 1-f(x), by (b). Since the former inequality 
contradicts x eS, we have in fact fy(x) 2 1—f(x). Hence 1—/(x) </(x). 
Similarly f(y) < 1—f(y). Hence x # y. Write S’ = {x:xe~ S} and 
S” = {x: xe ~ S}. The ordered pair 


a(S) = (S’, S””) 


is a complemented set in the sense of [1], which means that if x eS’ and 
yeS” then x 4 y. The complement of S is defined to be the triple 
~ 5 = (1-f, {fj}, M). Clearly a(~ S)= ((~ SY, (~ SY’) = (8", 8’), 
which by definition (see [1]) is —(S’, S”) = —a(S). 

Now if T = (9, {g9;}, N) is a second partial set, we define S < T to mean 
that for each positive integer m there exists a positive integer y = y(m) 
such that |f—g] < f,—9,. Clearly, S < Tif and only if ~ S < ~ T. Also 
S < U whenever S < T and T < U, and x eT whenever xe S and S < T. 
Hence S < Timplies that «(S) < «(T), in the sense that x e a(S) + x €a(T) 
and xe —a(S) + xe —a(T). 

For the partial set S considered above, write [S = |fdy. For the partial 
sets S and T given above, S < T implies 


f5-fr|-| foe fo 


The union Sv T of the partial sets S and T is 
SvT = (max {f, 9}, {f;+9;}f21,5 M+N). 


< i Fane 


< | (f,-Im)dp < { f,du < o(S). 
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It is easily seen that Sv T is a partial set, with o(SvT) = w(S)+oa(T). 
Similarly, SA T is defined to be 


(min {f, 9}, {fi t+9j}7-1,M+N). 
It is a partial set. We have 
~(SVT)=~SA~T and ~(SAT)= ~Sv ~T. 
hi ee Ss aad then 


SivSe-< 7,v7T, and S,AS, 24,4 i. 
Also 
a(SvT) <a(S)Ua(T) and a(SAT) < a(S) + a(T). 


A measurable set S={S(n)}y~1 is a sequence S(1) < S(2) <... of. 
partial sets, with 


S(n) = (F (1°), Fim )}j=1 > M(), 
such that lim,..,, o(S(”)) = 0. The limit 


p(s) = tim | f(a, ae, 


called the measure of S, exists. If T is another measurable set, we define 
(SvT)(n) = S(n)vT(n) and (SAT)(n) = S(n)AT(n) for each n. Also 
(~ S)(n) = ~ S(n). It follows that Sv T, SAT and ~ S are measurable 
sets, and the usual algebraic laws are valid. 

For each measurable set S, we take x e S to mean x € S(n) for some a, 
and xe ~ S to mean xe ~ S(n) for all n. Correspondingly we write 
a(S) = (Jr, a(S(n)). Then a(~ S) = —a(S). 

If the measure y(S) of the measurable set S is positive, then [f(n, -)du > 
> @(S,) for some n, and a construction similar to that of [1] gives a point 
x € S(n). Hence x € S, so that a(S) is non-void. 

If {S(-, k)}y2-1 is a sequence of measurable sets, such that 

C = lim n(S(5 1) Vey Sk) 
exists, a somewhat complicated definition basically similar to that given 
in [1], leads to a measurable set S = V ,u, S(-, k) having the following 
properties: 


(1). n(S) = C 
(2). a(S) < Ures a(S(-, k)). 
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An operation /\ is defined similarly. The theory of chapter 6 of [1] can be 
developed in this framework. 

To formalize in 2 the notion of an abstract measure space, definition 1 
of chapter 7 of [1] must be rewritten as follows. A measure space is a family 
M = {A,\,e7 of complemented subsets of a set X relative to a certain 
family F of real-valued functions on X, a map p: T — R°*, and an addi- 
tional structure as follows: The void set @ is an element 4,, of 4, and 
H(to) = 0. If s and ¢ are in T, there exists an element svt of T such that 
Ayy; < A, U A,. Similarly, there exist operations A and ~ on T, corre- 
sponding to the set-theoretic operations ™ and —. The usual algebraic 
axioms are assumed, such as ~ (svt)= ~sA~t. Certain measure- 
theoretic axioms, such as p(svt)+ p(s At) = p(s)+p(t), are also assumed. 
Finally, there exist operations V and /\. If, for example, {t,} is a sequence 
such that C = lim, u(t, V...V4,) exists, then V {t,} is an element of 
T with measure C. Certain axioms for V and A are assumed. If T is the 
family of measurable sets of a compact space relative to a measure p, and 
the set-theoretic function py: T— R°* and the associated operations are 
defined as indicated above, the result is a measure space in the sense just 
described. 

Considerations such as the above indicate that essentially all of the 
material of [1], appropriately modified, can be comfortably formalized in 2. 

Much effort has been expanded in developing formal systems to accom- 
modate Brouwer’s theory of free choice sequences and related constructs. 
{ am of the opinion that this is not the appropriate approach to Brouwer’s 
ideas. Presuming we are satisfied with 2 as a vehicle for predictive mathe- 
matics, I think we should realize the non-predictive portions of intuitionistic 
mathematics as part of the metatheory of Z, rather than trying to incorporate 
them in some modification of 2. (Aspects of Brouwer’s approach to free 
choice sequences lead me to think he might have been sympathetic to this 
point of view.) Now the metatheory of 2 is based on the work of Gédel [2], 
who shows that every function proved in 2 to exist can be constructed 
(simply by unwinding the existence proof) by means of certain canonical 
operations. Kreisel [5] has shown that every function /, of the type mapping 
sequences of integers into integers, constructed by means of Gédel’s 
canonical operations, is continuous, in the sense that if wg 1s any sequence 
of positive integers there exists a positive integer N such that if w and a’ 
are any sequences of integers with |w| < w 9 and w(n) = w’(n) for all 
n< N then f(@) = f(a’). This can be regarded as the central result of 
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Brouwer’s theory of spreads. Thus we can develop Brouwer’s ideas as a 
metatheory. In particular, Brouwer’s result that every /: R > Ris continuous 
emerges as a meta-theorem, which states that if we construct within 2 a 
function f: R > R having a certain property, then we can construct a con- 
tinuous f having the same property. In fact more is true, the canonical 
function which is constructed by unwinding the formalized proof of the 
existence of fis continuous. After these results have been proved as meta- 
theorems, it becomes constructively meaningful to build a new formal 
system to incorporate the metatheory. This is not attractive at present, since 
the theory of spreads has found no significant mathematical applications. 

In [1] I remarked that the constructive theory of Banach algebras, given 
in chapter 11, was forced and unnatural, and that some metatheory was 
indicated, to smooth the transition from the constructive to the classical 
proofs. Although it is too early to speak with assurance, it appears that for 
the particular application I had in mind the metatheory in question is 
provided by the numerical (or Gédel) interpretation of implication. The 
principle result of chapter 11 of [1] involves the notion of a partial ideal P 
of a Banach algebra 2, determined by elements x,,...,x, of % and a 
totally bounded subset A of 2, and defined as 


P= P(x, ee) 
= {x,yit...+x,),: EA for 1 < i < n}. 


If x,,..., x, generate an ideal J of 1, then the distance of J to the identity 
eis 1, so that for each A the distance of P to e is at most 1. Thus the statement 
S(x1,.-- X,), that x;,-. - x, generate anaidealweant be written 


VAWa dist (PG ees ec )ewor 


where A ranges over the totally bounded subsets of 2{ and @ over the positive 
constants <1. Now we want to constructivize the result I, that if 


X,,...X, generate an ideal J, then for each x in A there exists a complex 
number z such that x,,...,x,,x—ze generate an ideal; in other words, 
the statement S(x,,...,x,) 7 4zS(x,,...,%,,x—ze). A simple and 


trivial modification of the classical proof of T gives a constructive proof 
of the weaker statement I’ 

S(X1 5 ++ -5 %,) > VBVB2z dist (P(xj,.-:, x.) x— ze; Bye) 9, 
where B ranges over the same set as A and f the same set as «. The numerical 
meaning of I” is 

VBVBiz5Al¢’Q, 
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where Q is the statement 
Mist (P(x, cn A) ec eacacs dist (Pix, , es Xe ze; B),2) =p. 


This is just the constructive substitute for [ which was proved in [1] by 
extremely tedious considerations. 

Certain other results in the theory of Banach algebras, not treated in [1], 
cannot be constructivized so simply. For example, an unpublished result 
Q of the author, first published in [7], states that if y is a differentiable arc 
in complex n-space C", then every continuous function f: y + C can be 
uniformly approximated on y by polynomials in the coordinates z,,..., Z, 
of C”. Stolzenberg informs me he has constructivized the classical proof, 
in case y is analytic, by no means a simple task. To try to get a cheap con- 
structivization, note that for each k the distance d,(f, y) of f to the poly- 
nomials of degree k, with coefficients bounded in absolute value by k, is 
computable. (The proof is left to the reader.) Thus our statement Q has 
the form 


VE VyWmakaj(j > madj(f, y) < m7’), 


where f ranges over the set of functions from y to C, y over the differ- 
entiable arcs in C”, and dj(/, y) is the jth rational approximation to the real 
number d,(f, y). Actually this is an abbreviation of 


Vf vyvm(Vy ACF, y, y) > Ska > madi(f, y) < m~*)), 
or 


Vf VyVmakajay(ACf, », y) > (i > madi(f, y) < m™")), 


where now the variables f, y, m, k, j and y range over all functions of certain 
types, and Vy A(f, y, y) is the statement that f and y actually belong to the 
above-mentioned sets. Thus our approximation statement Q is an V3i- 
theorem, which means it can be written in the form VuivP(u, v), where P 
is decidable. Now it is an empirical observation that Va-theorems of classical 
mathematics tend to be constructively valid, so we suspect there is a meta- 
theorem to that effect. For instance, we might try to prove the metatheorem 
M that every V3-theorem provable in the system 2’ obtained by adjoining 
the axiom of the excluded middle to 2 is constructively valid. (Presuming 
the classical proof of the above theorem Q can be formalized within 2’, 
such a metatheorem M would imply that Q is constructively valid.) Now 
Spector [6] has proved such a metatheorem M. Unfortunately, his proof 
involves an inductive procedure whose meaning is unclear, and so the 
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question is still open. At least for certain special theories, such a meta- 
theorem could be extremely useful. 

The system 2, with the predictive interpretation of implication, can 
certainly be presented as a programming language like fortran and algol. 
As stated before, each theorem T of 2 has the form 4xVy A(x, y), where x 
is constructed in the proof of 2. We should be able to write a compiler for 
the language 2, so that whenever a proof of such a theorem 7 is read into 
our computer, the computer will compile a program to compute the con- 
structed quantity x. What do we mean by a program to compute a given 
function x of a given type? Without loss of generality, we consider only 
functions x whose values are finite sequences of integers of a given length 
n. Our question can be answered by induction. Presumably we know what it 
means to program the computer to compute a given integer. If x is an 
n-tuple (x,,..., x,) of functions, to program the computer to compute x 
we program it to compute each of the functions x,,..., x,. Finally, if x is 
of type t > [0], a program to compute x is a program to compute x(u) 
for an arbitrary argument u of x. Since x(u) is determined by the values of u, 
the only information the program will need about wu is the values of u at 
certain of its arguments y,,..., 7,, where of course y; may be a function 
of u(y,),...,u(y;-1), and & itself may be determined in the course of the 
computations. Thus the program for computing x will contain as sub- 
programs the programs for computing y,,..., y,. The computation of 
u(x) will request the values u(y,),...,u(7,) at certain junctures, which 
will be supplied by the program for computing u we supply the machine 
when we request x(w) in conjunction with the programs for computing the 
y; (which are part of the program for computing x). Thus by induction we 
see what it means to program the computation of a given function x. 
Of course, the program may call on other programs, or subroutines, rep- 
resenting proofs of theorems referred to in the proof of the given theorem. 
Definitions may be called as well. Types of functionals will presumably be 
established by type declarations, as in algol. 

As an example, consider the theorem of Koksma [4] that the set A of all 
6 > 1, for which the powers {6"}°., are equidistributed modulo 1, is a full 
subset of the set of real numbers > 1, which means its complement has 
measure 0. Jonathan Tennenbaum has asked whether it is possible to ex- 
plicitly exhibit an element 6 of A, in the sense we can compute an arbitrary 
term of the decimal expansion of 9. The answer hinges on whether Koksma’s 
proof is constructive. It is, except at one point, and that point is easily 
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constructivized. The upshot is that the answer to Tennenbaum’s question 
is ‘yes’. Thus, having realized Y as a programming language, we could feed 
the formalized constructivized version of Koksma’s theorem into the com- 
puter, then feed in a positive integer n, and request the machine to output 
the mth term of the decimal expansion of the fixed @ constructed in the 
proof. We might have to wait a long time. 

It would be interesting to take Y as the point of departure for a reasonable 
programming language, and to write a compiler. 

ADDED IN Proor. The author wishes to thank G. Kreisel, J. Myhill, and 
G. Stolzenberg for correcting inaccuracies in the original draft. Myhill 
notes that in view of such definitions as that of (f, u) € C, it would be more 
comfortable to extend Y by adding as axioms the formulas which specify 
the numerical interpretation of implication. On further research, I have 
not been able to find in [5] or elsewhere in Kreisel’s work the quoted result, 
that every ( mapping sequence of integers into integers is continuous. 
Kreisel proves a somewhat weaker result, a form of pointwise continuity. 
However, the quoted result is valid. The numerical interpretation of state- 
ment (2), as given above, is valid only when each of the variables x, y, u, v 
ranges over a basic set — a set for which no computations are necessary to 
check that an element belongs to the set. In 2, all variables range over basic 
sets; in informal mathematics, all statements can presumably be phrased 
in terms of variables ranging over basic sets. 
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ON THE NOTION OF RANDOMNESS 


PER MARTIN-LOF 


This is a contribution to the investigations into the notion of randomness 
by von Mises, Wald and Church. 
For the sake of simplicity, we shall only consider infinite sequences 


Re 


of the binary digits 0 and 1 obtained by tossing an ideal coin. Adapted to 
this case, Church’s formulation [1] of von Mises’ definition of randomness 
may be phrased as follows. 

Firstly, it is required that the limit frequency should equal 4, 


where 5, = X,+X,+...+x,. Secondly, this is required not only for the 
original sequence but also for every infinite subsequence 


aS. 


obtained by taking a recursive function f, which is defined for all finite binary 
sequences and takes the values 0 and 1, and selecting one after another 
those indicesya for which f(x, x; ...X,-1) = 1. 

The sequences satisfying this definition of randomness form a set of prob- 
ability one with respect to the measure p which makes all coordinates in- 
dependently take the values 0 and 1 with probability 4. A recursive sequence 
is necessarily non random. 

A serious defect of this definition was revealed by Ville [5]. He showed 
that there exist sequences which satisfy the definition but nevertheless have 
the property 
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the probability of which clearly equals zero. Thus von Mises’ definition 
appears to a great extent arbitrary. Not even such an intuitively appealing 
property as the oscillative behaviour of the relative frequencies necessarily 
holds for sequences which are random in his sense. 

The practice in probability theory is rather the following. As soon as we 
have proved that a certain property, such as the law of large numbers, 


wis 1 
lim + = -, 
n>o ff 2 


or the law of the iterated logarithm, 


lim sup ee in 2 f= she ; 
n> co Jn log log n 2 
has probability one, we say that this is a property of randomness. However, 
if we try, within the classical mathematical framework, to define a sequence 
to be random if it possesses all properties of randomness, we are led to a 
vacuous notion. For all complements of one point sets have probability one, 
and hence the intersection of all sets of probability one is empty. Thus no 
sequence would be random. 

It is proposed to avoid this paradox, born of the classical conception of 
the totality of all sets of probability one, by restricting our attention to hy- 
perarithmetical sets or, equivalently, to properties expressible in the construc- 
tive infinitary propositional calculus. This may be regarded as a construc- 
tive version of the restriction to Borel sets which is usually accepted in pro- 
bability theory. Actually, the specific Borel sets considered there are always 
obtained by applying the Borelian operations to recursive sequences of 
previously defined sets, which means precisely that they are hyperarithme- 
tical, 

Our main purpose is to prove the following theorem: 


THEOREM 1. The intersection of all hyperarithmetical sets of measure one is 
a &} set of measure one. 

Proor. If one allows non-constructive methods, it is immediately clear 
that the intersection of all hyperarithmetical sets of measure one also has 
measure one, for there are not more than countably many such sets. It would 
be desirable to have a constructive proof of this, but that has to await the 
constructivization of measure theory applied to 2} and IT} sets in general. We 
content ourselves by noting that the statement to be proved, namely that 
the measure of a certain Z} set equals one, has the quantifier form 2}. 
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Let namely the £} set A be represented by a recursive monotone Suslin 
scheme, 


A = U 0) Ann...m> 


where the union extends over all choice sequences of natural numbers 
myn,...n,... and the sets A,,,,...,, are decidable. Then the statement 


u(A) = 1 may be written in the form 


Rt 


Wien NGO Yer Gay... nm) 1-27), 


my=1 m=1 
which is clearly }. 
It remains to show that the intersection of all hyperarithmetical sets of 
measure one is 2}. This will be done by representing it in the form 


RVe(P(e) A Q(e) > R(e, x)). 


Here Pisa IT} predicate expressing that e is the Gédel number of a hyperarith- 
metical set. Q is a IT} predicate such that, if e is the Godel number of a hy- 
perarithmetical set, then Q(e) expresses that the measure of this set equals 
one. Finally, R is a X; predicate such that, if e is the Godel number of a hy- 
perarithmetical set, then R(e, x) expresses that x belongs to this set. 

To construct the predicate P it will be convenient to view a hyperarith- 
metical set as a countable wellfounded tree to each top point of which there 
is associated an atomic set, that is, a set determined by a condition of the 
form x, = 0 or x, = I, and to each branching point of which there is asso- 
ciated one of the signs |) and (). More precisely, we define a branch to be 
a syntactic expression of the form Qon,Q,n2...Q)-17,Q1, where for every 
P12; ad, nis a mamraliaumber and Q;—, is) or ()jand, finally, 
Q, is an atomic set. A hyperarithmetical set may then be defined as a re- 
cursively enumerable set of branches which can be obtained by repeated 
applications of the following two inductive clauses. Firstly, a recursively 
enumerable set, whose only element is an atomic set, is hyperarithmetical. 
Secondly, if A,, A,,...,4,,... is a recursive sequence of hyperarithmet- 
ical sets, then the recursively enumerable set, consisting of all ()nQon,... 
Q;_,7,Q, such that Qon, ... Q;~,1,Q, belongs to A,, is hyperarithmetical. 
Similarly with () replaced by (). 

A hyperarithmetical set satisfies the consistency condition that, if two 
Bramches Porm, Pym... . PyoynpPpand Qon,Q 72... Q,;-17;Q, belong to 
iedad enn; fori=1,...j, then P; = O; fori = OF1,.. ., 7. In particu- 
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lar, if j = min (k, /) then k = 1. Now, it is clear that we can construct a 
recursive enumeration of all recursively enumerable sets of branches that 
satisfy this consistency condition. Let P(e) be the proposition that for all 
choice sequences of natural numbers n,n,... there exists a branch 
Qon,Q1nz...Q)~1,Q, which belongs to the set whose Godel number in 
this enumeration equals e. Then P is a IT; predicate and, by the bar theorem, 
P(e) holds if and only if e is the Gédel number of a hyperarithmetical set. 

As for the predicate Q it suffices to remark that the measure of a hyper- 
arithmetical set is a hyperarithmetical number, and, consequently,the state- 
ment that a hyperarithmetical set has measure one is hyperarithmetical. 
Also, this is so uniformly in the set considered. A predicate Q with the proper- 
ty that, if P(e), then Q(e) expresses that the hyperarithmetical set with 
Gédel number e has measure one, may thus be chosen JJ} or Lj just as we 
please. (Of course, this does not imply that there is a hyperarithmetical Q 
with the desired property.) We need Q in IT} form. 

A similar argument shows that a predicate R with the property that, if 
P(e), then R(e, x) expresses that x belongs to the hyperarithmetical set whose 
Gédel number equals e, may be taken either IT} or 2}. We need it only in . 
Z} form. The proof is now complete. 

It is proposed to call the X} set just constructed the set of random sequences. 
More precisely, it is the set of sequences that possess all hyperarithmetical 
properties of randomness. 


THEOREM 2. A hyperarithmetical sequence is not random, 

Proor. If a sequence is hyperarithmetical, then the one point set deter- 
mined by that sequence is a hyperarithmetical set of probability zero and 
hence disjoint from the set of random sequences. 


THEOREM 3. The set of random sequences is not hyperarithmetical. 

ProoF. Suppose that it were hyperarithmetical. Being a hyperarithmetical 
set of positive measure (in fact, measure one) we could then find a hyper- 
arithmetical point in it which contradicts theorem 2. The fact that a hyper- 
arithmetical set and, more generally, a JT; set of positive measure contains 
a hyperarithmetical point is proved in Sacks [4]. 

Actually, the set of random sequences is not even IT}, because being 2}, 
it would then by Suslin’s theorem be hyperarithmetical in contradiction to 
theorem 3. 

It is a simple corollary of theorems 1 and 2 that there exists a /7? set in the 
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Baire space N™ which is non-empty although it contains no hyperarithmeti- 
cal points. To see this it suffices to represent the set of random sequences, 
which is a 2} set in the Cantor space 2”, as the projection of a I7? set in 
2% x N%. Since no hyperarithmetical sequence is random, this [T° set contains 
no hyperarithmetical points. Now, the set remains J7° if we consider it as a 
subset of N* x N* instead, and the latter space may be identified with N” in 
the usual way. The first example of a set of this kind, or, equivalently, of a 
recursive tree which is not wellfounded although all its hyperarithmetical 
branches are finite, was given by Kleene [2]. 

Since the set of sequences that satisfy Church’s form of von Mises’ 
definition of randomness is a hyperarithmetical (even arithmetical) set 
of measure one, it is strictly more inclusive that the set of random sequences 
defined here. By the same argument it is seen that the present definition is 
stronger than the one proposed in Martin-Lof [3], where the elements of a 
certain 23 set of probability one were called random sequences. 

Already Wald [6] proposed to sharpen von Mises’ concept of random- 
ness by defining a sequence to be random if it possesses all properties of prob- 
ability one which are expressible within a certain formalized logic such as 
Principia Mathematica. However, just as the set of random sequences de- 
fined here has turned out not to be hyperarithmetical, we must expect that 
Wald’s proposal leads to a set which is no longer expressible in the language 
we started with. And, if this language is Principia Mathematica or set theory 
as formalized by Zermelo and Fraenkel, it seems doubtful if we can ever 
get a clear conception of such a set of random sequences. 

The main improvement of the present paper as compared with Wald 
[6] is, firstly, the choice of a language, namely the constructive infinitary 
propositional calculus, which seems particularly well adapted for probability 
theory, and, secondly, the fact that we have been able to prove that the set of 
random sequences, although escaping the hyperarithmetical hierarchy, does 
not escape us completely but belongs to a class of sets, namely the 2{ sets, 
which can still be handled constructively, although, naturally enough, this 
requires abstractions more powerful than those needed on the hyperarithme- 
tical level. 
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ABSTRACT QUANTIFICATION THEORY* 


RAYMOND M. SMULLYAN 


Introduction. An alternative title for this subject might be ‘Quantification 
theory without logical connectives or quantifiers.” The point is that many of 
the well-known theorems about first order logic can be established as con- 
sequences of purely combinatorial lemmas which do not depend in any 
essential way on the logical connectives or quantifiers. In earlier works 
({4], [5], [6], [7]) we introduced a uniform ‘e, f, y, 5’ categorization for 
quantification theory such that in both the presentation of the postulates 
and the treatment of the metatheory the logical connectives and quantifiers 
never appear explicitly. The starting point for this present investigation is the 
realization that the mathematically interesting results of first order logic 
can be derived from certain very general properties of the «’s, f’s, y’s and 6’s 
without having to go back to their definition. Our central definition is that 
of a logical framework in which we abstract these key properties. The logical 
frameworks which arise in the concrete contexts of propositional logic and 
first order logic possess a certain property which we do not postulate in our 
general definition, viz. that no formula possesses an infinite descending 
chain of proper subformulas. The interesting thing is that this property 
(which we refer to as regularity or well-foundedness) is exploited in some com- 
pleteness proofs but not it others. As a result, different proofs of the same 
completeness theorem generalize in our abstract setting to distinct complete- 
ness theorems. 

This paper is but a brief sketch in which we give the main definitions and 
statements of theorems, but not the proofs. The proofs should be easy to 
supply by those readers with a background in quantification theory, par- 
ticularly by those familiar with the tableau point of view. A complete ex- 
position of this subject is currently in preparation. 


* This research was sponsored by the Information Research Division of the Air Force 
Office of Scientific Research under grant number 43367. 
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1. Logical frameworks 


DEFINITION 1. By a (classical) logical framework £ we shall mean an order- 
ed sextuple <E, C, D, g, ~, P> of the following items obeying the condi- 
tions stated alongside: 

(1). E£ is a well-ordered set whose elements we call the elements of the 
framework. 

(2). C is a subset of E whose elements we call conjunctive elements. 

(3). D is a subset of E whose elements we call disjunctive elements. 
Elements in CUD will be called compound; elements of E outside CUD 
will be called atomic. 

(4). @ is a function which assigns to every compound element x a finite 
or denumerable sequence ¢(x,,X2,...,X,,--->» Of elements of E whose 
terms we call the components of x, more specifically, we refer to x; as the 
ith component of x. We require that if x be both conjunctive and disjunc- 
tive, then x has only one component. 

(5). ~ is a function which assigns to each x € E an element X in Ecalled 
the conjugate of x. We require that conjugation obeys the following laws 
(for all x in E): 

(Gy SE S2 se 

(ii). the conjugate of X is x; 

(iii). if x is conjunctive, X is disjunctive, and if x is disjunctive, then X is 
conjunctive; (From this it follows that if x is atomic, so is X.) 

(iv). @(%) has the same number of terms as g(x), and the ith term of 
(xX) is the conjugate of the ith term of p(x). (Thus if p(x) = <x,, X2,..., 
Pe Mihero(x)= aay. ee) 

(6). P is a function which assigns to each x a finite set P(x) of positive 
integers. We say that x depends on i if ie P(x), and otherwise that x is 
independent of i. 

This concludes our definition of a logical framework. 


We use the symbol « to mean any conjunctive element x such that p(x) is 
finite, 8 for disjunctive x such that (x) is finite, y for any conjunctive x such 
that ~(x) is infinite, 6 for any disjunctive x such that g(x) is infinite. We 
write a; for the ith component of « and similarly with 8. We write y(i) for 
the ith component of y, and similarly with 6. Our conjugation laws imply 
that % is some f, and that for any i < the number of terms of g(«), (x); = 
= a;. Likewise for any y, is some 6, and 7(i) = ¥(i)- 

Applications. Take classical first order logic formulated with ~, A, v, 
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>, V, J as independent primitives, and formulated with individual variables 
(no, .2) 8, 2. and individual parameters (constants)@,,05,. 15,04). .. 
All formulas will be assumed closed. Now we introduce two new symbols 
T and F, and we define a signed formula as an expression of the form TY or 
of the form FX, where X is a formula. In the framework we now construct £ 
is to be the set of signed formulas. Our «’s shall be all signed formulas of one 
of the four forms T XA Y, FXv Y, FX > Y, F ~ X; our f’s shall be all 
elements of one of the forms T XV Y, FXAY, TX > Y, T~ X. Our y’s 
shall be of one of the forms T (Vv) (v), F (Av)¥(v) (where vis any individual 
variable), and our 6’s shall be elements of one of the forms T (4v)¥(v), 
F (Vv) ¥(v). Our function @ shall assign to each w a 2-term sequence <a, 
a2», to each f a 2-term sequence ¢f,, 82>, to each y a denumerable sequence 
<y(1), y(2), ..-, p(2),...>, and to each 6 a denumerable sequence <6(1), 
5(2), ..., (a), ...>, where the components «;, B;, y(i), 5(i) are defined by 
the following tables: 


% | Oy | %2 B | By | B2 
Ty we rxe| Ty. TX Gy MW rem TY 
Pay YO oF Xme| Pee FX, Var Yee) FY 
PX ene T Xora T XS Ve EXAM) TY 
Fr X ie seb T XE X 4) FX 

y |r) 5 | 5(i) 
T (Wo) ¥(v) | T¥(a,) T (Av)¥(v) | Tw(a,) 
F (Av) ¥(v) | FY (a;) F(Wv)¥(v) | FY (a;) 


We next define the conjugate of a signed formula as the result of changing 
T to F or F to T (thus TX = FX; FX = TX). The reader can easily verify 
that all our conjugation laws of (5) hold. Finally, for any signed formula X we 
define P(X) to be the set of all positive integers i such that the parameter a; oc- 
curs in X. 

We have thus described the framework that arises with classical first 
order logic. We remark that although we used ~, A, v, >, V,4 as inde- 
pendent primitives, we could just as well have taken any complete subset. 
One advantage of our uniform treatment is that it does not commit us to 
any particular choice of primitives. We might also see that our scheme could 
accommodate joint denial, or Sheffer’s stroke function, or converse implica- 
tion as primitives (but not e.g. the bi-conditional). 
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Frameworks are also applicable to languages with expressions of denu- 
merable length. For languages with expressions of non-denumerable length, 
we would have to extend the notion of a framework by allowing ¢(x) to be 
a possibly non-denumerable sequence. These extended frameworks are stu- 
died in Linden [3]. 


2. Regularity and induction. We now return to the purely abstract develop- 
ment of the subject of frameworks. In this section and the next the function 
P will play no réle. 

By a descending chain for x we mean a finite or denumerable sequence 
whose first term is x and which is such that every other term is a component 
of the preceding term. We call y a descendant of x if y is a term of some des- 
cending chain for x. (Our notion of descendant plays the réle of the usual 
notion of subformula, or rather of positive subformula in the sense used by 
Roger Lyndon.) 

We call x a regular element if it has no infinite descending chain. We call 
x uniformly regular if there is a finite upper bound for the lengths of all 
chains for x. We define the degree of a uniformly regular element x as the 
smallest k greater than the lengths of all chains for x. We call £ well-founded 
or regular if all elements of # are regular, and uniformly regular if all 
elements of # are uniformly regular. 

The framework of ordinary first order logic is regular, in fact uniformly so. 
The framework of propositional logic with denumerably infinite conjunc- 
tions and disjunctions is regular but not uniformly regular. 

We define Y to be a Boolean framework if for each x, p(x) is a finite 
sequence (in other words there are no y’s or 6’s). We remark that regular 
Boolean frameworks are automatically uniformly regular (this is a conse- 
quence of K6nig’s infinity lemma). 

Call a property (of elements of £) inductive if it holds for all atomic ele- 
ments, and for any compound element x, if it holds for all components of x, 
then it also holds for x. The following induction principle is basic: 

If a property is inductive, then it holds for all regular elements. 


3. Semantic notions. We let S be a subset of E. 


DEFINITION 2. We say S is closed downwards if for every conjunctive ele- 
ment c of S, all components of c are in S, and for any disjunctive element d 
of S, at least one component of dis in S. We say S is closed upwards if the 
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converses hold, i.e. if for every conjunctive c, c € S providing all components 
of c are in S, and for any disjunctive d, de S providing at least one compo- 
nent of dis in S. 


DEFINITION 3. We call S a Hintikka set if S is closed downwards and for 
no atomic element p it is the case that p and p are both in S. We call Sa 
Kalmar set if S'is closed upwards, and for every atomic p at least one of p and 
pisin S. 
DEFINITION 4. We call S a quasi-truth set if S is both a Hintikkaset and a 
Kalmar set. Equivalently, S is a quasi-truth set if S is closed both upwards 
and downwards, and if for every atomic element p exactly one of p and pis 
in S. 

We call S a truth set if S is closed both upwards and downwards, and if 
for every element x exactly one of the pair x, X lies in S. 


DEFINITION 5. We call x valid (in the framework .) if x belongs to all truth 
sets, and satisfiable if x belongs to at least one. We call S (simultaneously) 
satisfiable if S is a subset of at least one truth set. (We note that x is valid 
iff X is not satisfiable. ) 

We call S quasi-satisfiable if S is a subset of some quasi-truth set. We call 
x strictly valid if X is not quasi-satisfiable. 


DEFINITION 6. By a basic set B we mean a set of atomic elements such that 
for any atomic element p exactly one of the pair p, p lies in B. 


By induction one easily proves 


THEOREM |. Suppose M is closed downwards and U is closed upwards and 
every atomic element of M lies in U. Then every regular element of M lies in U. 


CoROLLaRY |. In a regular framework, if two truth sets contain the same ato- 
mic elements, then they are identical. Hence also, in a regular framework 
a basic set can be extended to at most one truth set. 


Duality. Let # be a framework <E, C, D, 9, ~, P>. By the conjugate frame- 
work Y we shall mean <E, D, C, @, , P>. (Thus conjunctive elements of 
F are called disjunctive in Y and vice-versa). It is trivial to verify that L 
is actually a framework. 

The following theorem, though trivial, is useful in some subsequent proofs, 
since it cuts the labor in half. 
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THEOREM 2. (DUALITY THEOREM). (a). S is closed downwards in £ iff 
E-—S is closed upwards in L; 

(b). Sis a Hintikka set in L iff E—S is a Kalmar set in £; 

(c). Sis a quasi-truth set in L iff E—S is a quasi-truth set in &. 
Likewise with truth set. 


We next need 


THEOREM 3. (a). Jf S is a Hintikka set, then for no regular element x can 
x and X both be in S. 

(b). If Sis a Kalmar set, then for every regular x, at least one of x and X 
lies in S. 

(c). If S is a quasi-truth set, then for every regular x, exactly one of x and 
x lies in S. 

One proves (a) by induction. Statement (b) can be proved by a dual ar- 
gument, or better yet, directly from (a) using the duality theorem. 


THEOREM 4. (a). Every maximal Hintikka set is a quasi-truth set. 
(b) Every minimal Kalmar set is a quasi-truth set. 
Again, one can prove either half by induction and obtain the other half © 
by duality. 


Now consider a basic set B. Let # be the union of all Hintikka supersets of 
B, and let # be the intersection of all Kalmar supersets of B. It is easily 
verified that #, is a maximal Hintikka set and that.#, is a minimal Kal- 
mar set. Thus theorem 4 yields 


THEOREM 5.4%, and # , are both quasi-truth sets. 


It is trivial to extend any Hintikka set H to a Hintikka set H’ which con- 
tains a basic subset B (just take all conjugate pairs (p, p) such that neither p 
nor p occurs in H and add one of them to H). Also H’ © #,, and #, is 
a quasi-truth set. 

Hence we have: 


THEOREM 6. (a). Every Hintikka set is quasi-satisfiable. 
(b). (After Hintikka). In a regular framework, every Hintikka set is 
satisfiable. 


Combining earlier theorems, we have 


THEOREM 7. (FUNDAMENTAL VALUATION LEMMA). Any basic set B of a 
regular framework £ can be extended to one and only one truth set S. And for 
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any Hintikka superset H of B and any Kalmar superset K of B we have 
MSS S IK 


Discussion. The inclusions H € S, S ¢ K of theorem 7 are quite important. 
The first one is substantially Hintikka’s lemma and underlies completeness 
proofs of so-called ‘cut-free’ systems. The second half (which is ‘dual’ to 
the first half) underlies Kalmar’s completeness proof for propositional logic. 
Indeed it yields 


CoroLiary 2 (After Kalmar). Let # be a regular Boolean framework 
and let | be a relation between finite subsets of E and elements of E satisfying 
the following conditions (for all S, x, y, x, B): 

Ky. For every xéES, Stx. 

K, Jf for each component a; ofa St a;, then St «. 

K,. Jf for at least one component B; of B S+ B;, then St B. 

K3. If Su{x}t y and Su {X}ry, then Sty. 

Then for any valid element x of B +x (where @ is the empty set). 


We remark that the conclusion still holds if we weaken the hypotheses 
by requiring them to hold only when S contains only atomic elements, and 
by requiring K to hold only when x is an atomic descendent of y. This then 
in an obvious manner provides a complete Gentzen type axiomatization 
of regular Boolean frameworks (if we replace ‘+’ by the Gentzen arrow and 
look at Ky as axioms and K,, K,, K; as inference rules. Rule K, is then a 
sort of ‘cut’ rule, but it is analytic (in the sense of [7]), i.e. proofs obey the 
subformula principle. 


4. Tableaux. By a tree 7 we shall mean a collection of finite or denumerable 
sequences (of elements of £) all having the same first term (called the origin 
of 7) and such that no sequence is a proper initial segment of any other. 
We refer to the elements of 7 as branches. If 0 isa finite branch (x,,..-, X,), 
then by (9, y) we mean the sequence (x,,...,X,,y). To extend a branch 
8 to (0, y), we mean the act of deleting 8 from 7 and putting in its place 
(9, y). To split or simultaneously extend 0 to (0, y;), ..., (8, y,) is to delete 6 
from 7 and put in its place (0, y,),..., (0, ¥,)- 

By an analytic tableau for a set S (of elements of E) we mean a tree con- 
structed as follows: 

We Start the tree by taking any element of S as the origin. Now suppose 
TJ isa tableau for S and @ is any branch. Then we may extend @ by any of 
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the following five operations (and the result is again a tableau for S): 


(A). If « is a term of 0, then we may extend 0 to (6, @;), where «; is any 
component of a. 

(B). For any term f of @ we may simultaneously extend 6 to (0, B;),..., 
(0, B,), where B,,..., 8, are the components of f. 

(C). For any y € 0 we may extend @ to (6, y(#)) (for any 7). 

(D). For any 6 € 6 we may extend @ to (6, d(i)), providing all terms of @ 

and all elements of S are independent of /. 

(E). We may extend 6 to (6, s) for any element s of S. 

A branch of a tableau is called closed if it contains some element and 
its conjugate; otherwise it is called open. A tableau is called closed if all 
its branches are closed. 

By a synthetic tableau for a set S, we mean a tree constructed using the 
rules above as well as the following: 

(F). For any element x of E, we may simultaneously extend 6 to (0, x) 

and (6, x). 
Rule (F) is the counterpart to Gentzen’s cut rule. 

Now we define a set S to be analytically inconsistent if there is a closed 
analytic tableau for S, and synthetically inconsistent if there is a closed syn- 
thetic tableau for S. We define an element x to be analytically (synthetically) 
provable if the unit set {x} is analytically (respectively synthetically) in- 
consistent, and by an analytic (synthetic) proof of x we respectively mean a 
closed analytic (synthetic) tableau for x. 

Semantic normality. We have still made no assumptions about the P func- 
tion. Now it is necessary to do so. We shall call & semantically normal if 
for every set S, every 6, every i and every / such that all elements of SU{6} are 
independent of j, if SU {d(i)} is satisfiable (quasi-satisfiable) then Su {6(j)} 
is satisfiable (quasi-satisfiable respectively). 

We henceforth assume ¥ to be semantically normal. We also henceforth 
let S be any denumerable set (of elements of E) such that there are only 
finitely many 7 such that some element of S' depends on i. 

One easily shows: 


THEOREM 8 (Correctness of the Tableau Method). (a). If S' is quasi-sa- 
tisfiable, S is analytically consistent. If x is analytically provable, x is strictly 
valid. 


(b). If S is satisfiable, S is synthetically consistent. If x is synthetically 
provable, x is valid. 
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By the method of systematic tableaux (cf. e.g. [1]), one proves 
Lemma 1. Jf S is analytically consistent, then S is a subset of a Hintikka set. 
The above lemma with Hintikka’s lemma gives 


THEOREM 9 (ANALYTIC COMPLETENESS THEOREM). If S is analytically consis- 
tent, S is quasi-satisfiable. If x is strictly valid, x is analytically provable. 


Next we define a set M to be E-complete if for every 6 € M, there is some i 
for which 6(i) € M. Then one easily shows 


LEMMA 2. If M is both E-complete and a maximal set which is synthetically 
consistent, then M is a truth set. 


By a Henkin type construction, one can extend a synthetically consistent set 
to a set M satisfying the hypotheses of the above lemma. Hence we have 


THEOREM 10. (SYNTHETIC COMPLETENESS THEOREM). Jf S is synthetically 
consistent, S is satisfiable. Also every valid x is synthetically provable. 


For regular frameworks theorems 9 and 10 say the same thing. For non- 
regular frameworks, they are distinct theorems which as far as we know are 
of incomparable strenths. 

We now know that analytic consistency is equivalent to quasi-satisfiabilty, 
and synthetic consistency is equivalent to satisfiability. Since analytic con- 
sistency and synthetic consistency are both properties of finite character, 
we have 


THEOREM 11 (COMPACTNESS THEOREMS). (a). Jf all finite subsets of S are 
quasi-satisfiable, then S is quasi-satisfiable. 
(b). [fall finite subsets of S are satisfiable, then S is satisfiable. 


Elimina bility. Call an element x eliminable if for every finite set S, if SU{x} 
and SU {x} are both analytically inconsistent, then S is analytically inconsis- 
tent. In a regular framework analytic consistency is equivalent to satisfiabili- 
ty, and so every element is eliminable (Gentzen’s theorem). If # is not ne- 
cessarily regular, the situation is this. Suppose S is analytically consistent. 
Then S is a subset of some quasi-truth set S*, and for any regular x either 
x € S* or Xe S*, hence either SU {x} or Su {x} is quasi-satisfiable, hence 
also analytically consistent. We thus have 


THEOREM 12. For any (semantically normal) framework £#, every regular 
element x is eliminable. 
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Next we observe that the statement ‘every x is eliminable’ is equivalent to 
‘analytic consistency = synthetic consistency’ which is equivalent to ‘quasi- 
satisfiability = satisfiability’ which is equivalent to ‘every Hintikka set is 
satisfiable.’ Thus we have the following theorem (which is reminiscent of a 
result of Schiitte for type theory): 


THEOREM 13. For any (normal framework (£) (regular or not), the following 
two conditions are equivalent: 

(1). Every element is eliminable. 

(2). Every Hintikka set is satisfiable. 


Discussion of further results. For semantically normal regular frameworks, 
virtually all of the results of [5] go through, in particular the completeness 
of all the Gentzen type systems considered there, the first symmetric comple- 
teness theorem, and the first system of linear reasoning. For results like the 
fundamental theorem and Craig’s interpolation lemma we require addi- 
tional properties of our framework to which we now turn. 


5. Structurally complete frameworks. The framework of first order logic pos- 
sesses the following two properties: 

C,. For any x and y there is ana whose set of components is {x, y}. 

C,. For any x and any positive integer i there is some y such that 

y(i) = x and y is independent of i. 
None of our results so far has required these properties! We call & struc- 
turally complete if C, and C, hold. 

Concerning C,, we might impose the stronger requirement Cf that for 
any x and y there is some « such that g(a) is the sequence <x, y>. We remark 
that the framework of first order logic with primitive propositional connecti- 
ves ~, A, V, > does not have this stronger property, but if we added con- 
verse implication as another primitive, the stronger property Cf would hold 
(as the reader can easily verify). 

We now write x Ay to mean any « whose set of components is {x, y}, 
and (Vi)x to mean any y independent of i for which y(i) = x. We write 
xv y to mean any f whose set of components is {x, y}, x > y for any PB 
whose set of components is {X, y}, and (3i)x for any 6 independent of i for 
which 6(i) = x. (Humorously enough, we are restoring the logical connec- 
tives and quantifiers, but only in the metalanguage!) 

Property C, is all we need in order to state and prove the fundamental 
theorem. For Craig’s interpolation lemma we need C, as well, and also the 
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following additional facts about the P function: (1). For any a, « depends on 
i iff some component of « depends on i; (2). For any », i andj ¥ i, y(7) de- 
pends on j iff y depends on j; (3). x depends on i iff x depends on i. For such 
frameworks, the Craig-Lyndon lemma holds, where we now define an 
interpolation element for x > y as an element z such that (i) x > z and 
z > y are both valid; (ii). for any i, z depends on i iff both x and y depends 
on i; (iii) every atomic descendent of z is a descendent of both x and y. 


6. Structures for intuitionistic logic. We have recently been able to extend 
our uniform method and its corresponding abstract viewpoint to in- 
tuitionistic logic. We give here only an extremely brief sketch in which we 
are mainly interested in giving the appropriate definitions. 

By a structure ¥ we shall mean a triple (#, E,, E,>, where # is a frame- 
work satisfying an additional condition to be given later, E, is a subset of 
E whose elements we call 4-special, E, is a subset of E whose elements we 
call permanent. We call an element V-special if its conjugate is 4-special. 
4-special and V-special elements are collectively called special, and elements 
not called special are called ordinary. 

In applications to intuitionistic logic, the 4-special elements are to be sig- 
ned formulas of one of the three forms FX > Y, F ~ X, F(Vx)¥(x). The 
permanent elements are to be those formulas signed with T. 

We refer to ¥ as the underlying framework of /. The additional condition 
we must impose is that for any element Q which is either some y or 6 and any 
distinct i and j, if Q(i) depends on j, then Q depends on j. 

Models. The following notion of ‘model’ appears to be the appropriate ge- 
neralization of Kripke [2]. 

By a model. for SY we shall mean a quadruple ¢<¥Y, R, +, J satisfying 
conditions I and II below, where & is a collection of objects called universes, 
R is a reflexive and transitive relation between universes, | is a relation be- 
tween elements I of Y and elements x of FE, J is a function assigning to each 
universe a non-empty set of positive integers whose members we call the 
integers available in I. We call I’, an extension of I, if [,RI,. We use 
the notation ['* for any extension of [. We read “I+ x’ as ‘T yields x’ or 
‘T forces x’. Now for the conditions. 

I (a). For any I and x, I yields exactly one of x and x. 
(b). If P+ x and x is permanent, then for any ['*, ['* + x. 
(c). Ifiis available in I’, then jis available in I'*. 

II (a). For ordinary «, [+ « iff P yields all components of «. 
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For 3-special «, [| « iff some ’* yields all components of «. 
For V-special a, + « iff every [’* yields all components of a. 
(b). For ordinary y, [+ y iff for every i available in I, [+ (2). 

For 3-special y, + y iff for some I’* and all i available in r'*, '* 
yields (i). 

For V-special y, + y iff for every [* and every i available in I'*, P’* 
yields »/(i). 

Remarks 

(1). The conditions on the f’s and 6’s follow from II and I(a). 

(2). Our notion of ‘model’ derives from Kripke as follows: In Kripke’s 
models, the relation + is between universes and unsigned formulas. Let us 
say that I yields TX if [+ X, and I yields FX if I does not yield X. The 
reader can verify that conditions I and II are then a uniform version of 
Kripke’s conditions. 

(3). Our conditions are really more general than needed for intuitionistic 
logic. The fact is that in the structure of intuitionistic logic there are no 
V-special a@’s nor J-special y’s. 


Realizability. Following Fitting [1] we say that I realizes x if yields x and 
if every i on which x is dependent is available in I. Now x is called valid 
if for every model, each universe of the model realizes it. A set M 1s called re- 
alizable if for at least one model at least one universe of the model realizes 
all elements of M. 


Semantic normality. We call SY semantically normal if for every Q which is 
either some y or 6 and every finite set M (of elements of E), if MU{Q(i)} 
is realizable and if no element of MU{Q} is dependent on j, then MU{Q(j)} 
is realizable. 


We again assume semantic normality. 


Tableaux. In discussing tableaux for a structure Y, we need some device per- 
mitting us at various stages to declare an element on a branch to be cancel- 
led or inoperative on that branch (this can be formalized in many different 
ways). A branch is then understood to be closed if its set of uncancelled 
terms contains some element and its conjugate. 

Our classic tableau rules are then modified as follows. Rules A, B, C, D 
are the same providing the a, 8, y, 6 are either ordinary or V-special. But if 
they are J-special, one cancels all non-permanent elements on the branch 
@ before extending it. 


AV ABSTRACT QUANTIFICATION THEORY 91 


Completeness. Kripke’s form of the completeness theorem goes through in- 
tact for semantically normal regular structures , i.e. every consistent set 
is realizable (and hence every valid element is provable). Bifurcations of the 
completeness theorem for non-regular structures will be the subject of anoth- 
er study. 


Concluding remarks. Many theorems of intuitionistic logic (such as the com- 
pleteness theorem) do not depend in any way on which elements were sin- 
gled out as ‘special’. All theorems about structures (as we have defined them) 
apply as well to other subsystems of classical logic by simply changing the 
class of special elements. 
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SECTION B 


TRADITIONAL INTUITIONISM 


RECENT PROGRESS IN INTUITIONISTIC ANALYSIS* 


A. HEYTING 


In this lecture I wish to report about work done by two of my students, con- 
tinuing partly unpublished work by myself. Part of the results have been pu- 
blished in the thesis of Ashwini Kumar, now in Kanpur, India [1] and in 
that of C. G. Gibson, now in Liverpool [4]. 


1. The Brouwer-Stieltjes integral. Gibson worked on the Brouwer-Stieltjes 
integral. We cannot restrict ourselves to the case that the non-decreasing 
function 2 which defines the measure is everywhere defined, because it 
would then be continuous. It is sufficient to suppose / defined on a dense, dis- 
crete subspecies of the real line. As / is right continuous, its domain can be 
extended by the limitpoints of decreasing sequences <x;>, for which the se- 
quence <Ax;> converges. However, the theory is developed for a 4 with a 
dense, discrete domain; it is a theorem that functions with the same maximal 
extension give rise to equivalent measures. 2 must have the infimum 0 and 
its supremum must be calculable. 

Brouwer’s theory of measure and integration [3] can now be followed; 
instead of rational intervals cells are used; a cell (a, b]is a set {xa < xb}, 
where a and Db are in the domain of 4. It proved advantageous to use here 
the negative order relation: if a < 6 is the positive order relation, which in- 
volves that there are two different rational numbers between a and Bb, then 
a < bmeans ——(a < b). The main advantage of < over < is that by the 
use of < the difference of two cells is either a cell or the sum of two cells. 
It need not be true that every point in the domain of 4 is measurable; this 
is only the case if sup,<,A(y) is calculable. 

The theory can be extended to functions of bounded variation, provided 
we suppose that the variation is calculable. Every function yp of calculable 
variation which is continuous on the right, is the difference of two non- 
decreasing functions. Let v(y, x) denote the variation of w over [y, x], 
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p'(x) = sup, <,V(y, x) = v(— 0, x) and p(x) = w'(x)—u(x), then p' and 
u’’ are nondecreasing and continuous on the right. 

The converse does not hold: if 2 and 4’ are right continuous and non- 
decreasing, then A—A’ need not be of calculable variation. Let A, be the 
function defined by 4,(x) = Ofory < x, A, = 1 fory « x. Then var(A,—A,) 
= 2 for y#z and var(A,—A,) = 0 for y = 2. Thus if A,—A, is of calculable 
variation for all values of y and z, then it is decidable for all y and z whether 
y = zor y#z. In particular the function f defined by f(y) = 0 for y # 0, 
JF (0) = 1 would be everywhere defined and consequently continuous. This 
is contradictory. The result can be expressed by saying that the space of 
functions of calculable variation is not linear. It is a subset of the linear space 
of functions of bounded variation. 


2. Separable Hilbert space. Other work concerns the theory of separable 
Hilbert space ([1], [6], [7], [8]). It can be defined as usual by the set of se- 
quences <x;> such that )° x? converges; only the convergence must be taken 
positively. The notion of a sequence can here be taken in the sense of an 
ips. Von Neumann’s axioms can be taken over, but must be formulated more 
precisely [7]. The functions f such that f? is Brouwer summable form a 
Hilbert space B?; there is no doubt that this result can be extended to Brou- 
wer-—Stieltjes summability for a given measure A, though this has not been 
proved explicitly. 

In the theory of linear functionals difficulties arise with respect to the 
Riesz representation theorem. \/ A(F(/) = (f, A)) if and only if conditions 
(a) and (b) are satisfied: 

(a). F has a calculable supremum ||F'|| for unit vectors, 

(b). there is a vector fo of minimal length such that |F(fo)| = ||FIl?- 
If ||F|| #0, then there is a unit vector f; such that F(/,) = ||F Il, and |IF IA 
= fo satisfies condition (b). 

The separable Hilbert space H cannot be represented by a fan [2] but it ad- 
mits a representation by a spread. Suppose that H is given by sequences 
<x;> such that )? x? converges. Let G be the subset of H, consisting of finite 
sequences of rational complex numbers. G is dense in H. I construct a spread 
S as follows: Every sequence of natural numbers is an admissible sequence 
for S. Let g,, g2,... be an enumeration of G. The complementary law I of 
Sis defined as follows: P(<k,>) = g%,. F(<k1,~. +s Ka-15 Ky>) = 91, Where 
l is the k,-th number such that 
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Every element 9m, Gm,,--- Of S determines a point A of H such that 
|h—9m,| < 2°" for every r. Conversely every point in H is the limit of an 
element of S. 

The spread representation is used in the proof of the following theorem [1]. 


THEOREM. A linear operator T which is defined all over H, is uniformly con- 
tinuous on H. 

Proor. For hé€ H we can find a sequence q = q,, q2,... in S such that 
(g:—A| < 27'~? for every i. Then, if |a—h’| < 27"~1, we have |q,—h'| < 27! 
for 7 < n, so there is a sequence q’ in S starting with q,,..., 9, and coinci- 
ding with A’. Letr = r,,1r,,... be a sequence in S that coincides with Th. 
For a given m, r,, must become known after a finite segment of g, say g,,..., 
9, has been chosen. It follows that r,, is the same for Th’ as for Th. We have 
then 

[h'—h| < 27"! > |Th'-Th| < 27™*?. 


This is the case for every full operator T. If T is linear, we have 
| = eee [Tf <2e"t 
for every f, so 
(h’'—hl < 27°"! > (Th'—TA| < 27-"*! 
for every / and /’. 


A closed linear manifold M in H need not have a calculable dimension. If 
a finite or denumerably infinite orthonormal basis for M is known, then M 
is a subspace of H. The orthogonal complement of a linear manifold M 
is a closed linear manifold M~. Even if M is a subspace, it is not always 
possible to write every vector fin the formf = f; +/,, where f, ¢ M,f, € M~. 
A necessary and sufficient condition for the general possibility of this de- 
composition is that M is located; this means that the distance of every 
vector from M is calculable. It is sufficient to know the distance from M of 
every vector in an orthonormal basis e,, e,,... of H. Let b,, b,,...be 
an orthonormal basis for M@, where b; = )), b,,¢,. Then an equivalent con- 
dition is that )’,bj, is calculable for every k. 

M < M*++, but M** = M cannot be asserted, even if M is a closed sub- 
space. 

Example: M is defined by the basis 6,,5,..., where b; = e,, except when 
the i'"-(i+9)" digits in the decimal expansion of x are all 7; then b; = e,. 
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e, ¢ M~ © there are no 10 consecutive 7’s in 7 

e, ¢ M** «it is impossible that there are no 10 consecutive 7’s in n. 
Always M¢*t++ = M7". 

The spectrum of a full completely continuous self-adjoint operator, for 
which the quadratic functional (Af, /) is positive definite, can be determined 
by the method of Hilbert-Courant, which has the advantage that the charac- 
teristic values are calculated one after the other. Let F be any full completely 
continuous functional, then mo = sup);;=,F(/) is calculable. Also, if M 
is a finite-dimensional subspace, then 


m(M) = sup F(f) 
femM+ 
1fj=1 


is calculable, and so is m, = inf m(M) for all k-dimensional subspaces M. 
The sequence mp, m,,... is non-increasing and has the limit 0. If F(f) = 
= (Af, f), then my, m,,... are the characteristic values of A. All this can 
be proved intuitionistically, but the proofs are more complicated than the 
classical ones; they use several times the fact that a function defined on a 
bounded closed located species has a calculable supremum, [8]. 

The characteristic vectors of A need not be calculable, even in a finite- 
dimensional space. 

Example: 


F(x, y) = x? +y? +(ax+by)’; My = 1+a7+b?, m, =1; 
to = (—6, a), ip = (a, b). 


If no positive lower bound is known for fa], nor for |bj, we shall not always 
be able to calculate the direction of fo. However (and this is also the case in 
Hilbert space) if my # m,, then fj can be found such that Ff) = my. 

Ashwini Kumar [1] has considered full self-adjoint operators, not ne- 
cessarily completely continuous. His definitions of the resolvent species 
RS(7), the approximate point spectrum APS(T) and the continuous spec- 
trum CS(T) are positive versions of the classical definitions. For instance, 
Ae CS(T) if 


S#g > (T-A)f# (T-A) 9 


and for every k there is a unit vector A such that |(T—2)'h| > 2*. 

RS(T) - APS(T) = 9, but as might be expected, it cannot be asserted that 
RS(T) u APS(T) = C. Instead we have the theorem that RS(T) U APS(T) is 
congruent with C; this means that there cannot exist a complex number which 
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cannot belong to RS(T) U APS(T). APS(T) is real; this does not mean that 
every non-real number is regular. For z = x+iy, y #0, a necessary and 
sufficient condition for z to be regular is that the species of the points 
(f, (T-—x)/y)f) in Hx H, for f in H, is located in Hx H. For self-adjoint 
operators it is unnecessary to define the spectrum; it can be identified with 
APS. 

The representation of the resolvent by a Brouwer-Stieltjes integral is 
obtained by a double limit process. First we consider the subspace H,, span- 
ned by the first n basic vectors; let P,, be the projection on H,,andT,, = P,,TP,,. 
T,, can be represented in H,, by a Hermitian matrix (¢,;). We can find the 
characteristic values of this matrix, but not always the characteristic mani- 
folds. To overcome this difficulty, we approximate the ¢;; by rational num- 
bers #;; such that the matrix (¢;,) is hermitian and )'(t,;—t/;)?> < 27". The 
characteristic values of (¢;;) are algebraic numbers, so for any two of them 
it is decidable whether they are equal or apart, and the corresponding char- 
acteristic subspaces can be found. We add to these Hy with the character- 
istic value 0. Let T,, be the operator, determined by (¢;;) on H,, and 0 on 
H, then 


(Ta-2)f.9) =| (2 ACEO, 0). 


where E,(A) is defined in the well known manner. A passage to the limit gives 
+2 
((T-z)f,g) = tim | (A—z) 'd(E,(A)f, 9). 


3. Non-separable Hilbert space ([1]). No example of a non-separable Hil- 
bert space is known. The same sort of difficulties because of which the space 
of functions of calculable variation is not linear occur in the theory of al- 
most periodic functions. cos Ax is a.p. ifand only if A = 0Oor1 #0. 1fA #0 
and py # 0, then e'“* +e! is a.p. if and only if 2 = p or 1 # yu; this follows 
from |e'**+e'*|? = 2+2 cos(A—p)x. It follows that a trigonometric series 
y a,e'*** can only be a.p. if for every k, / it is decidable whether 2, = 4, or 
1, % A,; the sum of two series which satisfy this condition need not be a.p. 


The inner product of e'** and e'** is 
Ti . A 
lim =| ed : ut noe 
TP 65) 2T =F 1 if A = it 3 


so it is only defined in these two cases. 
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The mean value of an a.p. function f is always calculable, but that of 
f(x)e74* need not be calculable for every 1; moreover, the Fourier exponents 
need not be calculable. 

It seems that a general theory of a.p. functions is unfruitful. The theory 
can be restricted to Fourier series with mutually apart exponents, but even 
these do not constitute a Hilbert space. 

From the intuitionist point of view it is natural to consider Hilbert space 
with the binary fan BF as its index set. Every element of App is a function 
f with a domain D, ¢ BF and a range A, ¢ C (complex numbers), satis- 
fying the following conditions: 

1. for every k there is given a finite species Q, of mutually apart elements 

of D,; for every k, O, © Ops, 

2. for every k and m, Weo,. eo, \{@)- = 2ue 

3. If ae D, and a is apart from every element of Q,, then | f(a)|? < 2“. 
It is clear that ae D,—\), O, >/(a) = 0. Thus we may extend D, to 
\) O, Vv (BF—-\)J Q,) and put f(a) = 0 for ae BF—(|) Q,. 

Unfortunately, Hp, is not linear, because from /, g € Hp, it does not fol- 
low that f+g € Hg. In fact, we are not always able to find Q,(f+g) satis- 
fying the conditions 1, 2, 3 above. Also the inner product (f, g) is not always 
defined. 
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A THEORY OF CONSTRUCTIONS EQUIVALENT TO ARITHMETIC 


NICOLAS D. GOODMAN 


A theory of constructions is a type-free and logic-free theory directly about 
the rules and proofs which underlie constructive mathematics. The idea of 
such a theory as a basis for mathematics is implicit in many of the intuition- 
istic writings. For example, in Heyting [2], an informal theory of this sort 
is tacitly used for the interpretation of the logical connectives. The first 
attempt to formalize such a theory and to use it to give a semantic founda- 
tion for intuitionistic logic is in Kreisel [4]; the formalization is carried 
further in Kreisel [5]. Our purpose here is to continue this work by describ- 
ing a theory Z of constructions which is proof-theoretically equivalent to 
arithmetic. The proof of this equivalence and a discussion of its proof- 
theoretic applications will be postponed to a later publication. 

The results of this paper, in a somewhat different form, are part of the 
author’s doctoral dissertation at Stanford University, written under the 
direction of Professor Dana Scott. We wish to express here our deep grat- 
itude to Professor Scott for his patience, his sympathetic guidance, and his 
many stimulating suggestions. We also wish to express our indebtedness to 
Professor Georg Kreisel for several valuable conversations on the subject 
of this paper. 


1. The definition of intuitionistic implication. The major difference between 
intuitionistic and classical logic is that the classical mathematician thinks of 
himself as reasoning about an objective, external domain of entities. Every 
meaningful assertion about that domain is objectively either true or false. For 
the intuitionistic mathematician, on the other hand, it does not make sense 
to talk about the truth of a mathematical proposition independently of 
the question of whether or not one hasa proof of it. Classically, in order to 
explain the meaning of a proposition it suffices to give the conditions under 
which it is true. Intuitionistically, one must rather explain what it means to 
have a proof of the proposition. For example, if we already understand the 
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propositions %{ and %, then we explain the implication 2 —> % by stipulating 
that a proof of 2{ > 8% is to consist of a method leading from any proof of 
Y% to a proof of 8, together with a proof that the method always works. 

In the next several paragraphs we shall analyze the presuppositions of 
this definition in an effort to determine what our eventual theory of construc- 
tions must look like. 


2. Terms. Since the proof of an intuitionistic implication is essentially a 
function, it is clear that, in addition to terms denoting the proofs themselves, 
the theory of constructions must contain notations for various operations 
on proofs. Moreover, since functions can themselves be parts of proofs of 
complex assertions, it is technically convenient to work in a type-free theory 
in which rules and proofs are not distinguished. We shall imitate the com- 
binatory logicians and think of a function of two arguments, for example, 
as a function of one argument whose values are again functions of one ar- 
gument. We write the result of applying a to b as (ab) rather than as a(b). 
Then we omit parentheses by associating to the left, so that aby... b,_,, 
the result of applying a to the arguments by,..., b,44, is 


(Ce. Gb, eee 


We treat everything as a function by taking objects which are intuitively 
not functions to be totally undefined functions. 

We may now give a formal definition of our set of terms. We suppose given 
a list v9, 0}, ¥2,... of variables. We use x, y, Z, u, v, w to denote variables. 
We also suppose given a set of constants. Each variable and each constant is 
a term. If a and b are terms, then (ab) is a term. These are the only terms. 
We use a, b, ¢, D, e, J, g, 9 to denote terms. Every term is uniquely of the 
form aby... 6,~,, where a is a variable or a constant. A term containing 
no variables is closed. We indicate the substitution of a for x in an expres- 
sion by prefixing [a/x]. 


3. The combinators. The correctness of the laws of Heyting’s predicate 
calculus (HPC) should follow directly from the interpretation of the logical 
connectives. For example, that 2{ — Y is a correct logical principle follows 
from the triviality that the identity function, applied to any proof of Y, 
gives a proof of 2. In order to be able to formalize such arguments in our 
theory of constructions, we must have at least certain elementary closure 
conditions on the functions of the theory. The simplest requirement we can 
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make is that they be closed under explicit definition. To that end we adapt 
Curry’s familiar combinators K and S (see Curry and Feys [1]). In order 
to be able to think of the constants K and S as denoting functions, we modify 
them so as only to be defined when their arguments are defined. Specifically, 
Kxy has the value x, and Sxyz has the value xz(yz), whenever the latter is 
defined. It is well known that K and S suffice for the treatment of explicit 
definition. Below we shall often make use of lambda-abstraction, remembering 
that this notation is eliminable in favor of K and S by the methods of Curry 
and Feys [1]. 

The theory of constructive functions at the base of our theory of construc- 
tions will be very similar to a classical free-variable theory of Gédel numbers 
of partial recursive functions acting on each other. Indeed, we can make a 
diagonal argument to see that ours must also be a theory of partial functions. 
This is to be expected, since Church’s thesis is not obviously false. Specifical- 
ly, suppose we have a term j with the property that, if x is not zero, then fx 
is zero, and, if x is zero, then [x is not zero. Then consider the term 


(Ax.fx))(Ax.{@»)). 


If this term had a value, then it would have to be a fixed point of the function 
denoted by j, which is impossible. Thus the term must be undefined. 

To avoid using partial functions, one would have to think of a construc- 
tive function as a pair consisting of a rule and a proof that the rule is every- 
where defined. But this is a derived notion which should be defined, within 
the theory of constructions, in terms of the more elementary notions of rule 
and proof. 


4. Intensional identity. Just as in classical recursion theory there is an under- 
lying notion of computation, so we need a notion of reduction. Any closed 
term of our theory is of the form ab)... b,.,, where a is a constant de- 
noting a rule. In order to evaluate this term we must apply the rule denoted 
by a to the arguments, if any, denoted by Do,..., b,_,. To do that, we must 
first evaluate the terms Dy, ..., b,_,. The result of the application of a will, 
in general, be another term itself requiring evaluation. In this way the intend- 
ed interpretation of the constants of the theory will induce the relation of 
reduction on the closed terms. This relation will be deterministic, since we 
always reduce the arguments first and then carry out the uniquely determined 
operation required by the rule denoted by a. A term which cannot be 
further simplified by this reduction process, and therefore directly denotes 
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a well-determined construction, is reduced. Thus a closed term is defined 
if and only if it reduces to a reduced term. In general, of course, it is 
undecidable whether or not the reduction process applied to a given term ¢ 
terminates. 

Our notation will be so chosen that syntactically distinct reduced terms 
denote distinct constructions. Thus we may replace the intuitive relation of 
intensional identity between constructions by the formal relation of inten- 
sional identity between closed terms which holds just in case they reduce 
to the same reduced term. If a and b are arbitrary terms of our theory, 
we write a = 6 to mean that a and BD are defined and have intensionally 
identical values, where we think of free variables as ranging over the 
reduced terms. In particular, an equation a = a asserts that a is defined. 

When we come to the axiomatization of our theory 7, we shall be care- 
ful to give enough axioms that every closed term which is defined can be 
reduced within.7 to a reduced term. Therefore, as we shall show in another 
paper, the theory itself induces a unique minimal reduction relation on the 
closed terms which satisfies all the axioms of the theory. 

The relation of intensional identity is an undecidable, purely existential 
relation. If we know that a and 6 are defined, however, then it is decidable 
whether or not they are intensionally equal. We introduce a constant Q 
representing the predicate of intensional identity. 

Since the basic identity relation of our theory is intensional, it makes sense 
to introduce operators, which we shall call discriminators, whose function 
it is to analyze the structure of the reduced forms of the terms they apply to. 
Their importance is primarily technical, in that they enable us to formulate 
certain principles in a more perspicuous way. We shall need three discrimi- 
nators. The first of these, which we call 6, determines whether or not the 
reduced form of its argument ¢ is of the form (a, a2). The other two discri- 
minators, 6, and 6,, carry this analysis further. Specifically, in the above 
case, we take 6,;¢ = a; fori = 1, 2. 


5. Pairing. In §1 we explained that a proof of an implication is a pair con- 
sisting of a function and a proof. Thus our theory must contain pairing func- 
tions D, D,, D,. We let Dxy be the pair consisting of x and y, in that order. 
Moreover, for i = 1, 2, we let D(Dx,x,) = x;. If x is not a pair, then we 
let D,x = x. 

For the interpretation of arithmetic, it is convenient to think of every 
object as a sequence. Specifically, we think of each x as the (eventually 
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constant) sequence 
Di Xe), DDD x). 


Then, for each natural number n, we can construct a term C, such that, for 
any x, C,x is the nth component of x thought of as a sequence in this way. 
Also we can construct, for each », a term p, such that, for any x and y, 
Pnxy 1s such that 


CAp y=) 


and, if m 4 n, then 


Coney On 


Now suppose that we are giving a definition of the logical connectives for 
a language containing quantifiers and free variables. Then what we must 
define is not ‘y is a proof of 2° but rather ‘y is a proof that the sequence x 
satisfies 2. Thus to each formula 2 we will assign a term |2{| such that, 
intuitively, |2{|xy has the value true if and only if y proves that x satisfies 1. 
Suppose we already have the terms |%{| and |%|. Then, reformulating the de- 
finition in § I, it is clear that we should have 


[Of + 8B] = (Axy.D,y proves that, for any z, if |%M|xz has the value true, 
then |S|x(D,yz) has the value true). 


6. Truth-functions. In a classical truth definition the logical connectives are 
presupposed, not defined. The definition in § 5, on the other hand, is actually 
supposed to explain implication. If the definition is not to be circular, then 
the ‘if ..., then’ in the definition must be essentially simpler than the in- 
tuitionistic implication being defined. This is achieved by requiring that the 
proof predicates |2| and |%| be decidable, so that, even from an intuitionistic 
point of view, we can make unproblematical use of truth-functional connec- 
tives. Let us look at those connectives more closely. 

It is convenient, though arbitrary, to let the truth-value T be (Axy.x), and 
to let the truth-value L be (Axy.y). 

Looking at the definition of implication in § 5, we observe that, if z is not 
as a matter of fact a proof that x satisfies 2, then D, yz may be undefined, 
since the function D,y is only supposed to apply to arguments which 
are proofs of the relevant kind. Thus we only want to try to evaluate D, yz if 
|2{]xz = T. Thus we need a truth-functional implication between partial 
predicates of one argument which gives the value T if the antecedent has the 
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value L, even if the consequent is undefined. We explicitly define a function 
>, of three arguments u, v and z, writing ‘u >, v’ for ‘>, uv’, in such a 
way that ifuz = T, then the function applies v to z, and ifuz = 1, then it ap- 
plies (Aw.T) to z. Remembering how we defined T and 1, we put 


>, = (Auvz.uzv(Aw.T)z). 

More generally, we put 

yp = (Atv 9 4 2pa - UZoe ee Wa Woe 
Similarly, we let 

Op = (Auvzg i Zee ctl cq Wea ee ec 
and 

WW, = (uizgn Zeon 2g See een oe ee eee | qe ays 
We can now write the definition of implication as follows: 


| + SB] = (Axy.D,y proves that, for all z, 
[(12]x) >, (Az.[Blx(D2yz))]z = T). 


7. Sequents. Since the relation = is not decidable, the truth-functions are 
not quite enough to give us all the basic logical apparatus we need. Neverthe- 
less, since we are working with partial functions, and since it is not obvious 
how to formulate the correct intuitionistic logic of partial predicates, we wish 
to minimize the logical presuppositions of our theory. The notion we need is 
that of a sequent. 

We use A and I to denote finite sets of equations, where an equation is 
an expression of the form a = b. Then a sequent is an expression 


All @ = id, 


This sequent is to be interpreted as an intuitionistic implication: if all the 
equations in A hold, then a = b. The formulas of our theory are sequents. 
Notice that we only assume implication between purely existential assertions 
and, in particular, we have no iterated implications. 

Except for the specification of the remaining constants, this completes the 
description of the language of our formal system. We postpone writing down 
any axioms until after we have completed the discussion of the intended 
meanings of the constants. 
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8. The proof predicate. Looking at the definition of implication at the end 
of § 6, we see that if |9f + S|xy = T, then D,y is a proof of an elementary 
statement in AE form. We shall take such proofs as basic and will not at- 
tempt to analyze their internal structure. Thus it is natural to introduce a 
primitive decidable operator x such that zuv has the value T iff v is a proof 
that, for all z, uz = T. Then we can write the definition of implication as 


[> B] = (xy . a[((Mlx) >, (Az « [Blx(Dz yz)) (Di y)). 


The decidability of the basic proof predicate x seems to conflict with or- 
dinary mathematical experience. After all, we often think we have a proof 
of an assertion when, as a matter of fact, the argument we have in mind is 
still confused. We take this fact as evidence not of the undecidability of the 
proof predicate, but rather of a lack of clarity in the way the putative proof 
is presented. From an intuitionistic standpoint, the verbal argument we have 
in mind is at best a rough guide to the construction of the infinite canonical 
proof which, if fully grasped, could not be misleading. Thus we assume that 
a clearly given construction always either is or is not a proof of a given 
assertion. 


9. The self-referential paradox. There is an essential impredicativity in our 
definition of implication. For the assertion that |%f > 8|xy = T involves 
quantification over all proofs of 2{, including proofs which may themselves 
have been built up in some way from y. Unless something is done to moderate 
this impredicativity, it actually leads to paradox. As a first step, we construct 
a self-referential sentence which asserts of itself that it is unprovable. 
Ret 

g = (Ayz.S(Ky)(Kz)). 
Then g is a total function of two arguments with the property that, if 
gyzx is defined, then 

+ gyzx = S(Ky)(Kz)x. 
Therefore 

Qyzx = gyzx t qyzx = Kyx(Kzx), 
and so 
Qyzx = qyzx lt gyzx = yz. 

The assertion that w does not prove that 


bFyz=T (1) 
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can then be written as 


F (x(gyz) >, (KL))w = T. 
Let 
= (Ayz.n(n(gyz) > 1 (K1))(z2)). 


Then the equation }yz = T asserts that zz is defined and proves that (1) is 
unprovable. Thus to construct our self-referential assertion it suffices to 
find a term a such that we can prove that az = haz whenever either side is 
defined. We take 


a = (Az.(Ay.h(yy)z)(Ay-h(vy)z)). 


We can explain the lambda-abstraction here in sucha way that ais defined. 
Then we have intuitively that 


Faz=T (2) 


just in case zz is defined and is a proof that no x is a proof of (2). 
Now, if something is proved, it is a fortiori true. Hence 


n(gaz)x = Tk gazu = T, 
and so, as above, 
nm(gaz)x = Thkaz=T. 


By our choice of a, therefore, 


n(gaz)x = Tt x(x(gaz) >, (KL))(zz) =T, 
so that 


n(gaz)x = Tt (a(gaz) >, (K1L))w ST. 
Recalling the definition of >,, we have 
nm(gaz)x = Th KlwsT. 
Therefore, by the decidability of z, 


F n(gaz)x = 1. (3) 
It follows that 
F (x(gaz) >, (KL))x = T. (4) 
Thus we have specified, uniformly in z, a proof of (4) for all x. Hence there 
must be a function f such that, for all z, 


+ n(x(gaz) >, (KL))(fz) = T. (5) 
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In particular, 
+ n(x(gaf) >, (KL))(Gf) = T- 


That is, t haf = T, and sof af = T. For any w, therefore, 
F gafw = T, 

and so we can choose Bb with 
F x(gaf)b = T, 


which contradicts (3). 

A similar argument shows that if the starred theory in Kreisel [4] is modi- 
fied so as to permit arbitrary lambda-abstraction, the resulting theory is 
inconsistent. This fact was noticed independently by Kreisel and by the 
author. 


10. Stratification of the theory of constructions. There is a fundamental 
heuristic principle to the effect that clear ideas lead to consistent theories. 
What we have left vague above is how constructions are built up. The set- 
theoretic paradoxes are resolved by observing that sets must be sets of ob- 
jects already at hand. Similarly we suggest that proofs must be about ob- 
jects already constructed. Just as in Zermelo set theory there is an implicit 
cumulative theory of types, so we propose to formulate a theory of con- 
structions involving a cumulative theory of Jevels. At the bottom level 
we will have constructive rules operating on each other. Among these will 
be certain basic, purely finitistic rules such as the combinators. We assume 
that it is decidable whether a construction belongs to this basic level, and 
as usual with decidable species, we identify the level with the function 
which gives T or L according to whether its argument is or is not an element 
of the level. We introduce a constant B to denote this basic level. Given any 
level L, we suppose that we can extend L to a new level containing all the 
objects of L, all proofs about objects of L, and certain additional construc- 
tions to be described below. If a denotes L, then we let Ea denote the extend- 
ed level. We emphasize that this is not a stratification by logical type, but 
rather a stratification according to the subject matter of proofs. 

Applying an idea of Kreisel, we can think of the stratification slightly 
differently. If the operator x of constructive universal quantification 1s it- 
self to be a construction, then this quantification must only be over a domain 
which has been grasped as a totality. But we cannot grasp the whole of the 
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constructive universe, which is always only a potential totality. The only 
grasped domains which we shall consider here are those which are maximal 
in the sense that they include everything which is immediately understood 
when their elements are understood. For example, we could think of the spe- 
cies of natural numbers as a grasped domain. This domain is not maximal, 
however, since implicit in the species of natural numbers is the species of 
constructive numerical functions, and so on. In this paper we shall always 
use ‘grasped domain’ to mean a totality which is decidable, which is grasped 
as a whole, and which is maximal in this sense. Therefore the grasped do- 
mains are just our levels. 

We add a new constant G which, applied to any argument x, determines 
whether or not x is conceived as a grasped domain. Then we will have 
+ GB = T and 

Gx = Tt G(Ex) =T. 


From the present point of view the operator a of §8 no longer makes 
sense, since the only proofs we understand are proofs about the elements of 
a given grasped domain. Within this new framework, we must think of z 
as a function of three arguments, where mxyz = T means that x is a grasped 
domain containing y, and z is a proof that, for all win that domain, yw = T. 
Neither zx nor z will be in the domain x, but only in the extended domain 
Ex. The operator z itself will not be in any grasped domain. 


11. The reducibility operator. Suppose we have a grasped domain a. Then 
we wish a to be as self-contained as possible. Therefore, if we have any rule z, 
then we suppose that we can find a rule Faz in the domain a with the proper- 
ty that, if x isin a and zx is defined and in a, then 


+ Fazx = zx. 


We do not care about the values of Faz for arguments x such that x is not in 
a, such that zx is undefined, or such that zx is defined but not ina. 
Suppose Z is actually not in the domain a. Then we shall not assume that 
if Fazx is defined, with x in a, then the value of Fazx is just zx. For, this prin- 
ciple would imply that z is somehow involved in the definition of Faz, con- 
trary to our conception of the levels as built up from below. Instead, we sup- 
pose that Fa, applied to z, chooses some rule already present in a which, 
extensionally, has enough of the right values to serve as a representative 
for z. The operator Fa cannot use the fact that z is undefined at an argu- 
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ment x without looking at the meaning of z and therefore giving a function 
as value which is itself of level higher than that of a. 

The effect of the reducibility operator F is similar to the effect of an 
impredicative comprehension principle. Suppose, for example, that we have 
proved 


AnV ma, m), (6) 
where # and m are numerical. Then that proof gives us a function f such that 
AnX(n, fn). 


The description of f may involve objects of very high level. For example, the 
proof of (6) may be by modus ponens, so that f may depend on understanding 
the proof of a complex auxiliary proposition. However, since n and m are 
certainly in the basic domain B, we also have 


A nX(n, FBfn), 
and FBf is itself in B. 

The introduction of the reducibility operator F is made necessary by the 
impredicative character of intuitionistic implication. It seems to us essential 
to the intuitionistic position that given a fixed assertion 2{ about a well- 
defined domain, there is always an a priori upper bound to the complexity 
of possible proofs of Wf. In case is an implication, this principle already 
guarantees the existence of some sort of reducibility operator. 


12. Proofs. Now suppose is an element of the grasped domain a. Suppose 
we have a formal proof in our theory of the assertion that, for any x in a, 
fx = T. Then we /let af denote an intuitive proof of the assertion. We take 
jaf to be in the extended domain Ea. Since the formal proof may involve 
lemmas about objects of higher level, the introduction of the operator % 
involves another impredicativity of essentially the same nature as that in- 
volved in the introduction of F. Intuitively, “af denotes an infinite canon- 
ical proof of the assertion. This canonical proof depends only on a and f 
and not on the structure of the formal proof, which can be thought of as an 
indication of how to construct the infinite intuitive proof “af. If we have 
no formal proof of the assertion, then, in general, we are not able to construct 
the intuitive proof, and Zaf is undefined. 


13. Avoiding the self-referential paradox. We have now completed the 
description of the language of our theory 7 of constructions. The constants 
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we Shall use are K, S, D, D,, Dz, 6, 5,,62, Q, E, B, G, nx, F and #. The set of 
terms is then defined as in § 2. Sequents are defined as in § 7. 

Before beginning the axiomatization of 7, let us consider how the para- 
dox of § 9 is now avoided. There are several ways in which one might hope 
to reconstruct the argument leading to the contradiction. The most naive 
proposal might be to choose a so that 


ta = (dz.n(Ec)(nce(gaz) >, (K1L))(zz)), 


where c is a grasped domain, and where we write ‘Ec’ rather than ‘c’ as the 

scope of the left-most application of x since we can only correctly apply 

mc to a rule in the domain c. But in this case the step from (4) to (5) is un- 

justified because a itself is only known to be in the domain Ec, not inc. 
Alternatively, we might choose a so that 


t a = (Az.n(Ec)(ne(g(Fea)z) >, (KL))(zz)). 
But then, at the beginning of the argument, we only obtain 
ne(q(Fea)z)x = Tt Feaz = T, 
which is not enough to give 
ne(q(Fea)z)x = Thkaz=T 


even with the additional premise - cz = T. 


14. Axioms about consequence and identity. We begin with the rules of weak- 
ening and cut. 
Il. If dt a=), then A,c=dDta= Bb. 
2,.1f4,a=brkcec=dandAta=b,thenAtc=dD. 
Next we have substitution for free variables, with an additional premise to 
guarantee that the term being substituted is defined. 
I3. Ifdta=s=b,thenc =, [c/x]A+ [c/x]a = [c/x]b. 
We have reflexivity, symmetry, and substitutivity of intensional identity. 
14. If ais a variable or a constant, then + a = a. 
S.a=btb=a. 
6. [a/x]b = c, D = ak [d/x]b =c. 
We can only apply a function to an argument if both are defined. 
I7. ab =abta=a. 
8. ab=abtb=b. 
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As applications of the rules of I, the following are all provable: 
(i) a=bra=b. 
fii). a= bb=Sctazeec. 
(iii). a= brka=a. 

We do not, in general, have 


a=b,c=dDtac= bo, 


since well-defined functions and arguments may lead to undefined function 
values. 


15. Axioms for the combinators and pairing. The basic axioms for K, S, D, 
D, and D, are as follows: 
ll. | Kxy = x. 
2. Sxyz = Sxyz Sxyz = xz(yz). 
3. xz(yz) = xz(yz) + Sxyz = Sxyz. 


4. | Sxy = Sxy. 
pee D(x) x5) 2, 
6. | D;x = D;x. 


We also need axioms asserting that if x is not a pair, then + D;x = x, but 
we are not yet in a position to formulate them. 

The above axioms suffice for the treatment of explicit definition and se- 
quences as in §§ 3, 5, and 6. 


16. Axioms for the discriminators. We first wish to assert that the operator 
6 represents a decidable predicate. Let us formulate what this means in 
general. We say that a term ais a decidable predicate of n arguments iff when- 
ever we have 

AOD were = bc =p 
and 

A,ab)...b,-; =LFc=bd, 
we also have 

Amine sneer en Dib FOr Ds 


If a is a decidable predicate of n arguments, then 
fe le een pe — ON ye a 


III 1. 6 is a decidable predicate of one argument. 
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Next, 6;x is defined only if the reduced form of x is composite. 
M26. =O x rox = 7, 
3 ee = Leas = Oe 
No constant is composite. 
Il14. Suppose a is a constant. Then | 6a = L. 
Finally we need an axiom guaranteeing that 6; has the right values. 
III5. Suppose ais K, S, D, Q, E, 2, F, ~, Sy, Dy, ny, Fy or xy. Then 


ax = axt 6,(ax) =a 
and 
ax = axt 6,(ax) = x. 


17. Axioms for the predicate of identity. The constant Q is to represent 
the decidable predicate of intensional identity. 
IV1. Q is a decidable predicate of two arguments. 
2. +t Oxx =T. 
3, Oxy =Ttx=y. 
4. If a and b are distinct constants, then | Qab = L. 
We are now ready to write down the axioms which assert that D; has the 
right values on arguments which are not pairs. 
IV 3. 0x =e) eee 
6. 6(6,x) =Lt D;x =x. 
7. QD(6,(6,x)) = Lt Dix = x. 
As an example of the application of Q, we observe that it enables us to dis- 
tinguish pairs from objects which are not pairs without using the discrimi- 
nators. For x is a pair just in case x is the pair of its components. Thus let 


P = (ax.Qx(D(D, x)(D>x))). 


Then P is a decidable predicate of one argument which is such that, intui- 
tively, | Px = T if and only if x is a pair. 


18. The rule of induction. In order to interpret arithmetic, we need a prin- 
ciple justifying arguments by induction over the natural numbers. In the 
context of an intensional theory such as ours, it is natural to add a principle 
of induction on the way in which the reduced terms are built up. This can be 
formulated as follows: 
V. Suppose x does not occur in A or a. Suppose 
(i). 4, 6x = 05 axl 
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(i), Al a(o,x) = 1, do x) = Tea =7. 
Then A+ ax =T. 
Now we assign a numeral n to each natural number n as follows: 
(0 =k 
(ii). a+1 = DOn. 
A reduced term ais a numeral if it is 0 or if it is a pair whose first component 
is 0 and whose second component is a numeral. Then, imitating this defini- 
tion, it is not difficult to construct a closed term N, without using the dis- 
criminators, such that: 
(i). N is a decidable predicate of one argument; 
(ii). for each n we have + Na = T; 
and 
(iii). the following special rule of induction is derivable: Suppose x does 
not occur in A or a. Suppose 
(2) en a0 sie 
(b). 4, Nx = T, ax = Tt a(D0x) = T. 
Then 4, Nx =Thkaxe=T. 
For each primitive recursive function H we can construct a closed term 
fy such that, using induction, fy, can be proved to satisfy the defining equa- 
tions for H. 


19. Axioms concerning grasped domains 
VII. Gis a decidable predicate of one argument. 
2. -GB=T. 
3. If Ga = T, then + G(£a) =T. 
For each natural number p, we define the pth level L, as follows: 
GO). Lay Ss 
(ijt — LE. 
The only reduced terms of the theory 7 which denote grasped domains are 
the terms L,. This will not, however, necessarily remain true when TJ is 
extended. Thus instead of asserting directly that these are the only grasped 
domains, we add axioms which assert, for each term of Y which is not an 
L,> that it is not a grasped domain. 
VI4. Suppose ais a constant of 7 other than B. Then Ga = 1. 
5. Suppose a is a constant of 7 other than £. Then 


One =r Gx = 1, 
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6. Suppose a is any constant of 7. Then 
6,(6,x) =at Gx =1. 


The operator E only applies to grasped domains. 
VI7. Ex = Ext Gx =T. 
Any grasped domain is a decidable predicate of one argument. 
VI8. Suppose 
(i). 4,ab=Thke=d. 
(ii). 4,ab=ihc=bd. 
Then 4,Ga=T,b=bFc=bD. 
Any element of a grasped domain is also an element of the extended domain. 
VI9. Gx = T,xy = Tk Exy =T. 
A grasped domain is closed under any function belonging to that domain. 
VI10. Gx = T,xy = T,xz =T, yz = yzt x(yz) =T. 
Certain elementary rules belong to the basic domain. 
VI11. Suppose a is K, S, D, D,, D, or Q. Then + Ba =T. 
There are other operators which, because of their great generality, belong 
to no grasped domain. 
VI12. Suppose a is 6, 6,, 62, E, G, x, For ”. Then 


Gx =TKFxa=l. 


Every level is an element of itself, but not of the previous level. 
VI13. Gx = Tkxx ST. 
14. Gx = Tk x(Ex) = 1. 
In certain cases one can argue from the level of a reduced term to the level 
of its components. 
VI15. Suppose a is K, S, D, Q, Sy or Dy. Then 


Gx = T,6,z =a,xz = Tt x(6,z) =T. 
16. Suppose a is S or D. Then 
Gx = T, x(ayz) = Tk xy ST. 
20. Axioms about the reducibility operator. We begin with two axioms about 
the level of a term of the form Fa. 
VIII. Gx = Th x(Fx) = T. 


2. Gx = T,6,;y = F,xy = Tk x(6,y) ST. 
Of course, Fx is only defined if x is a grasped domain. 
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VIL3. Fx = Fx + Gx ST. 
Next we have two axioms about the level of terms of the form Fab. 
VII4. Gx = Tt x(Fxz) = T 
5. Gx = T, x(Fyz) = Th xy ST. 
Finally we have the characteristic axiom about the values of functions of 
the form Fab. 
VII6. Gx = T, xy = T, x(zy) = Th Fxzy = zy. 


21. Axioms about proof. Again, we begin with axioms about the levels of 
certain constructions. 
VIII1. Suppose a is x or %. Then 


Gx = Tt Ex(ax) = 
and 
Gx=Trx(ax)=HL 


2. Suppose a is x or #. Then 
ax =axtGxsT. 
3. Suppose a is x or #. Then 


Gx = T, axz = axzt Ex(axz) = T 
and 
Gx = T, x(ayz) = Tt x(ay) = T 


4. Suppose a is x or £. Then 
axz = axztxz=T. 


5. mxzy = Tt Exy =T. 
If a is a grasped domain and b is an element of that domain, then zab is a 
decidable predicate of one argument. 
VIII6. Suppose 
(i). 4, mabe = TE bd =e. 
(ii). 4, mabe = Ltd =e. 
Then 4,Ga=T,ab=T,c=ctd=e. 
The reflection principle asserts that anything provable is true. 
VII. axzy = T,xw=Thkaweswt. 
The rule of proof asserts that formal derivations correspond to correct in- 
tuitive proofs. 
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VIII8. Suppose x does not occur in A, a or b. Suppose 


4,ax =Tt bx =T. 
Then 
A, Ga =T, ab = Tt zab(fab) = T. 


A proof of the form /ab is only a proof of one assertion. 
VIIL 9. axz(f~yw) = Thy =x. 
10. mxz(~xw) = Thz=w. 
Finally, we need three axioms asserting that various objects are not proofs. 
VIJI11. Suppose a is any constant of 7. Then 


Gx =T,xz=Thrxza=l. 
12. Suppose a is any constant of 7. Then 
Gx=T7,xz=T,6,;y =at axzy = 1. 
13. Suppose a is any constant of 7 other than £. Then 
Gx =T,xz =T, 6,(6,y) = at axzy = 1. 


This completes the description of the theory 7 of constructions. 


22. The interpretation of arithmetic. We suppose given a formulation of 
Heyting’s arithmetic (HA). We shall not describe this theory, but refer the 
reader to Kleene [3] as an example. Then, to each formula 2 of HA, we 
wish to assign a closed term || of J such that, for any sequence x and any 
y, |Uxy = T if and only if y is a proof that x satisfies 2[. First, to each term 
t of HA we assign a closed term |t] of Z such that, for any sequence x, 
|t]x is the value of t at x. The definition is by recursion on the structure of f: 

2) Oe 

2. |0| = (Ax.0). 

3. |f| = Ax.D0(li]x)). 

4. Suppose # is a primitive recursive function of k arguments introduced 
by certain defining equations. Let fy be the term introduced in § 18 to prov- 
ably satisfy these equations. Then 


Elian « - oonemey esate pp (lisfba) - oo Ucabel. 


Now, the level of a term |2| will depend on the depth of nesting of quanti- 
fiers and implications in %{. Accordingly, to each formula 2 we assign a 
depth, d(2l), by recursion on Y: 
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ena Set) — 0: 

2. da B) = d(Wv B) = max {d(W), d(B)}. 

3. d(% > B) = 1+ max {d(Q), d(B)}. 

4, d( Vv, 2) = d(H). 

5. d(Av, MX) = 1+d(2). 
We can now define the term || by recursion on the structure of Y: 

First, y proves that x satisfies an equation 8 = ft just in case y is of level 
zero and [3|x = |t|x: 


[3 = t| = (xy . By Ao Q(13|x)(It|x)). 


The construction y proves that x satisfies 2{ A 6 just in case the first compo- 
nent of y proves that x satisfies 9{ and the second component of y proves that 
x satisfies 8: 

[AA Bl = (Axy . [W]x(D1 y) Ao |Blx(D2 y)). 


The construction y proves that x satisfies 9 v 6 just in case either y proves 
that x satisfies 2{ or y proves that x satisfies 8: 


[Av Bl = (Axy . [xy Vo [Blxy). 


Suppose p = d(% > %). Then d() < p, and so any proof that x satisfies 
% has level < p. Therefore, in defining |2{ > 9] it suffices to quantify over 
jie 


[> Bl = (axy . aL L(x) >; (Az . [Blx(D2 yz) (D1 y)).- 


The construction y proves that x satisfies V v,, 2{ just in case the first compo- 
nent of y is a natural number and the second component of y proves that 
x, with the first component of y in the nth place, satisfies 1: 


[Vv, WU] = xy. N(D,y)) 92 (xy . [M(e,x(D1 y))(D2 y)). 


Finally, suppose d( A v,2f) = p. Then y proves that x satisfies /\v, 2 just 
in case the first component of y proves that, for any natural number z, the 
second component of y applied to z proves that x, with z in the nth place, 
satisfies 2: 


[A v, | = (Axy.aL,[N >, (Az. [Ul(e,xz)(D2 yz) (D; y)). 


Observe that the constant zero sequence is represented by 0. Then our main 
result about the interpretation of HA is this 


THEOREM. Suppose 2 is a sentence of HA. Then YU is a theorem of HA if and 
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only if there is a term a such that we have inT that 
F {20a = T. 


This theorem establishes the consistency of 7 relative to HA. 
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CHURCH’S THESIS: A KIND OF REDUCIBILITY AXIOM FOR 
CONSTRUCTIVE MATHEMATICS'* 


G. KREISEL* 


The purpose of this lecture is to expand the remark ([21] p. 146, 2.76) 
relating Church’s thesis in constructive mathematics to the ‘hypothesis’ 
V = L in set theory (which, by [4] p. 147, implies a natural form of the 
axiom of reducibility). Some problems suggested by the remark are solved 
in the Technical Notes (e.g. I and IT) below. Also (and, to me, this is perhaps 
most important) the body of open problems is coherent, and so there is at 
least a chance of some decisive results in this area. 

Two topics treated here may be of independent interest. A development 
of the theory of abstract constructions by use of the notion of graspable 
domain (§ 5) and a connection between Scott’s model [36] and the theory of 
lawless sequences (Technical Note III). 


Introduction: abstract impredicative and (more or less) explicitly defined 
notions. We consider principally: 

(i). The abstract notion of arbitrary subset and the power set operation 
(collecting all subsets of a given set); correspondingly, the notion of con- 
structible set, obtained by iterating the process of collecting (only) those 
subsets which are definable by means of formulae in the language of set 
theory, using names for elements of the given set. 

(ii). The (abstract) notion of constructive function (with integral ar- 
guments and values) as understood in intuitionistic mathematics, and ex- 
plained in [5]; correspondingly, the notion of recursive function, defined by 
means of recursion equations. 

The clause ‘more or less (explicitly defined)’ is needed because (i) the ab- 
stract notion of set (theoretic ordinal) is used for formulating the iteration 
process and (ii) the notion of constructive function is involved in the mean- 


t Concluding address. 
* Preparation of this paper was partially supported by grant DA-ARO-31-124-G 985. 
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ing of the quantifier combination V3 in Vndm T(e, n,m) (for Kleene’s T- 
predicate) used to define the notion of recursion equation. Only for certain 
segments of the constructible hierarchy and certain subclasses of the class 
of recursive functions are the relevant definitions independent of the cor- 
responding abstract notions (cf. my other lecture [26]). 

Undoubtedly the natural way, incidentally corresponding roughly to 
the historical development, of looking at the two sets of notions, is this: 

First we have the abstract notions and their basic properties (axioms), e.g. 
the axioms of Zermelo, Mahlo, Fraenkel in (i), Heyting’s arithmetic and 
extensions in (ii). These axioms are recognized by inspection or reflection. 
Afterwards defined notions are introduced. Those mentioned above (happen 
to) satisfy the axioms originally found for the corresponding abstract no- 
tions. This fact was not immediate on inspection, but had to be established 
by a detailed argument, by Gédel for (i), by Kleene for (ii) using the notion 
of recursive realizability.’ 

Note in passing that for mathematical practice too the abstract notions 
are primary. Thus mathematicians speak freely of sets, but get bored or 
confused by explanations of first order definability, let alone of its iteration. 
In constructive mathematics, Bishop’s book [1] owes much of its appeal (and 
cogency) to the fact that he does not write out recursion equations but works 
with the primitive notion of constructive function; for more technical points 
see also [24] p. 233. Finally, as a related and more elementary illustration 
([21] p. 103, 1. 42), the natural way of looking at the Euclidean plane is not 
to introduce coordinates (names for points) at all, let alone algebraic ones 
for points obtained by explicit constructions; their place is in refinements. 

Probably the first natural /ogical problem is to look for properties which 
distinguish between the abstract and corresponding explicitly defined notions. 
In set theory this kind of question was suggested by Gédel [6] and first 
carried out (by use of axioms about measurable cardinals) in Scott [35]. 
Another, quite different, attempt is given in a letter (1963) of Myhill quoted 
in [28] (by use of axioms for random reals). From the present point of 
view it is equally natural to try and refute 


VfseVnip[ Tle, n, p)afn = U(p)] 


(for constructive functions f, numbers e, n, p, Kleene’s T and U) which we 
shall here call Church’s thesis.2? One expects to use recondite properties of 
the basic constructive notions but, if possible, less problematic ones than 
those used by Scott and Myhill above. (Evidently the project makes sense 
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only if one understands or, as one sometimes says, ‘accepts’ the basic notions; 
it would be circular if one thinks of the explicitly defined ones as replacing 
the others.) 

Let me end this introduction with a word on the difficulties of the pro- 
ject. Subjectively, we live at present in a logical tradition which insists on 
algebraization or arithmetization (in place of the geometric way of looking 
at things). Not only (as in Euclid’s axioms) do we list explicit properties 
of notions, but we try to list the means of generating the objects studied. As 
an illustration, contrast the explicit definition of the property of continuity 
with the (algebraic) restriction to e.g. polynomials built up by addition and 
multiplication. In its extreme form this tradition requires the kind of ex- 
plicit description mentioned as a prerequisite of precision itself (cf. [23] con- 
cerning formal and informal rigour). Whatever its merits or defects the 
tradition exists and presents a psychological obstacle to the study of ab- 
stract notions. For this reason, I give elementary examples in § 1 (and con- 
sider them basic.) 

Objectively, the nature of the abstract notions is problematic; indeed 
this is perhaps their principal interest for the logician (though of course 
not for the working mathematician). Evidently the analysis of the notions 
is the ultimate aim of our research. But we should be able to say at the start, 
at least roughly, what kind of entities these abstract objects are. This is 
not the place to go into the notion of subset.* But what are the constructive 
functions or rules, and, more generally, the constructive proofs for which we 
wish to set up a theory? Speaking for myself, I think their usual characteri- 
zation as mental objects (in contrast to their spatio-temporal or linguistic 
representations) is inadequate in an essential respect. The rules and proofs 
are of course involved in the mental (psychological) experience of mathe- 
matical activity. But the theories we try to set up are not about this ex- 
perience as it presents itself to us, they concern its logically significant as- 
pects. 

Before mechanically dismissing this distinction as ‘vague’ or ‘circular’ 
(cf. note 4 on p. 143 for a sermon) let us pursue it, and compare our pro- 
blems with those arising in mechanics, say. The objects considered in this 
science are not the (material) bodies of physical experience as they present 
themselves to our senses. To apply mechanics we have to determine the mecha- 
nically significant data in an empirical situation (mass of the body rather 
than, say, its colour, electric charges if there is electric interaction etc. ). Indeed, 
we need the theoretical notions to tell us which data are significant, but we 
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do not have a recipe for stepping from crude physical experience to those 
data. One last and highly relevant historical point. The notions and laws of 
mechanics (e.g. the equations for incompressible fluids) were derived from 
general qualitative experience, not from delicate measurements which, in- 
cidentally, can often be stated only in terms of the theoretical notions. 

In the last analysis (cf. [21], p. 191, 5.422) the interest in the general 
theory of constructivity is based on the conviction that the (mental) ex- 
perience of mathematical activity is susceptible to a similar kind of theory 
as we have for (familiar) mechanical phenomena. We can’t be sure that 
the conviction is well founded. But it’s a sensible idea, and, as we know, 
sensible physical theories, even when false, have led to interesting formal 
mathematical problems and results. This certainly seems to carry over to 
our project. 


1. Basic example: constructive (prima facie) non-mechanical rules for 
number theoretic functions (cf. [21], p. 131, 2.35). We consider a formal 
system such as Heyting’s arithmetic HA which we have recognized to be 
constructively valid. Thus to each formal derivation (with Gédel number) 
n corresponds a constructive proof, say p,. The rule f is given by cases: 

Case 1. If m is not a derivation of any closed formula of the form 3x2 
then fn = 0. Otherwise 

Case 2.1. fn = 0 if p, does not provide a specific numerical instance 
satisfying 2, e.g. if 3x has been inferred from Vx; 

Case 2.2. fn = x+1 if x is the number verifying 2{ which is provided 
by p,. 

Clearly, no Turing machine will react to this rule as it stands. But let’s 
look at it before calling it vague*. As a first test of understanding we show: 

f is not equivalent to the following (mechanical) rule f’ (Recall that 
(HA | 4x2) = dx + Wx where x denotes the xth numeral.): 

Case 1’. If n is not a derivation (in HA) of any closed formula 4x then 
f'n = 0. Otherwise 

Case 2’. f'n = x+1 where x is the argument of 9% in the shortest deriva- 
tion (in HA) of a formula of the form x. . 

The rules are clearly different since case 2.1 has no analogue here at all. 
They are not even equivalent. For let 8 and © be two closed theorems of 
HA such that © has a much shorter derivation than %, and let 2 be 


Ca 0A DV ee) 
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Let n consist of a derivation of 8, followed by the inference to 4x2{. Then 
al bub e—— 2 

We can go a little farther. Consider an enumeration %,, in a system S of 
arithmetic properties, and let (m,n, y) mean: 


If 7 is a derivation in S of 4x, then YU, y. 


If S is constructively valid, we have VmVndy (m,n, y) and hence 
hVmVnR(m, n, h(n)). But in general (in contrast to HA) we cannot expect 


(Sole Se og) =e ea aad lage a) 


for example, if (i) the closed formula 4x%x is constructively valid, (ii) for 
no numeral x is 1x%x > Wx derivable in HA, and (iii) S = HA vu {Axx}. 
In that case the only definition of 4 that we have goes via p,; while, for 
S = HA, our f’ above will (at least) be a solution of 


YmVn R(m, n, h(n)). 


REMARK. It is almost banal that we understand non-mechanical rules; 
on the contrary too detailed, that is ‘too’ mechanical rules only confuse 
the human computer. The point of considering the specific rule f above 
is to have precise questions in a familiar context. 


2. Basic example: formal problems. Having understood the rule f, we ask 

Problem I. Is f equivalent to some mechanical rule? 

Clearly the problem has the same character as any other problem of informal 
mathematics: we look for minimal assumptions about p,, as evident as 
possible, which are sufficient to settle the question. However, if preferred, 
there is a ‘corresponding’ purely formal problem. 

Recall that each of the following syntactic studies of HA assigns a specific 
term t, possibly containing free variables (corresponding to case 2.1), 
to any formal derivation of 4x: 

Kleene’s various recursive realizability interpretations ([{14]-[16]); 
Gédel’s functional interpretation [5]; its variant ({17], p. 112, 3.52); 
Goodman [8]; cut elimination by use of infinite proof figures ([34] as 
extended in [21], p. 164, 3.332 *). Let f, be the mechanical rule which the 
interpretation ¥% assigns to derivations in HA. 

Problem 2. Are the functions fy for the interpretations ¥ above, equiv- 
alent? (‘Equivalent’ either in the sense that the terms assigned to n by the 
f, denote definitionally® or extensionally equal objects.) 
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Problem 3. Which, if any, of the f, are equivalent to the non-mechanical 
rule f above? 

Problem 2 belongs to traditional proof theory of intuitionistic arithmetic, 
a subject very much in need of less crude problems than the hackneyed 
question: does + 4x2 imply 4x | 2x? Ad problem 3, note that t assigned by 
a given ¥% ‘satisfies’ 2{ in the sense of .%, not always for the intended inter- 
pretation. 


3. Church’s thesis: generalities; incompleteness of Heyting’s predicate cal- 
culus. For systems not containing variables for constructive (number 
theoretic) functions the axiom on p. 122 is replaced by 


VX {¥x4y(<x, y> € X) > FeVxdz[T(e, x, z) A <x, U(z)> € X]}, 
and for systems not containing variables X for species, by the schema 
Vxdy R(x, y) > deVxdz[T(e, x, z)A R(x, U(z))] (*) 


for all definable relations {t not containing free variables (parameters) for 
incompletely defined objects such as free choice sequences. 

Church’s rule is, by definition, the inference of the conclusion in (*) 
when the premise has been derived. 

As observed in [19] (p. 140, 1. 11-14, for 2 of theorem 3, p. 150) Church’s 
thesis implies incompleteness of Heyting’s predicate calculus, and this is 
sharpened in the Technical Note I below.’ Regarding Church’s thesis as 
neither plausible nor refuted, we can say that the notion of constructive 
validity of first order formulae depends on problematic properties of the 
basic notion of constructive function; like second order validity ([23], p. 
157), and unlike first order validity (cf. p. 144, note 7) in the classical case. 

The following question asks for an analogue to Skolem-Loewenheim: 

Problem 4. Is there a definition (in the theory of species of natural numbers 
or even in arithmetic) of the species of constructively valid formulae in the 
language of predicate calculus (without having to decide Church’s thesis)? 

But Church’s thesis would of course imply that the species so defined is 
not the set of theorems in Heyting’s system. (Note that there is no solution of 
problem 4 by a constructively enumerated species. ) 


4. Church’s thesis and Church’s rule; consistency and conservative extension 
problems. Evidently, given any formal system containing the notation 
needed for formulating Church’s thesis (or its variants in §3), it makes 


B VIII CHURCH’S THESIS 127 


sense to ask whether the addition of Church’s thesis or rule is consistent, 
or whether the system is closed under the rule®; more generally, whether the 
addition is conservative for suitable classes of formulae (corresponding to 
[21], p. 106, 1. 535, for the addition of V = L). 

Good candidates for such classes of formulae are usually arithmetic 
formulae in the fragment (—, A, V) or V5 formulae. But perhaps the subject 
is ripe for a more general treatment (possibly simplifying [15] a bit): 

Problem 5. Are there simple syntactic conditions (on axioms) which ensure 
that the axioms, added to intuitionistic predicate calculus, are (i) consistent 
with Church's thesis or (ii) closed under Church's rule? 

But, for immediate progress, it is probably more useful to indicate 
particular systems that present open problems. 

(a). Theory of species of natural numbers (like classical analysis with the 
comprehension axiom and the axiom of dependent choices, but intuitionistic 
instead of classical predicate logic). In Technical Note II I give a sketch 
of a proof of closure under Church’s rule (as I conjectured [21], p. 351). 
We have the following question (but cf. the postscript on p. 147): 

Problem 6. For what classes of formulae (if any, e.g. {0=1}) is the addition 
of Church’s thesis to the theory of species conservative? 

For a justification of the theory of species see the end of § 5 below. But 
problem 6 is of interest even for those who (have not thought out the 
justification, or who simply) are primarily interested in a narrower area of 
constructive mathematics. Various formal theories of (possibly iterated) 
inductive definitions or, equivalently, so-called higher number classes can 
obviously be modelled in the theory of species, and so the consistency of 
Church’s thesis with the latter covers also those systems. (Naturally a 
separate argument is needed to treat the rule above; cf. end of Note II.) 

(b). Theories of choice sequences. For a brief summary see Technical 
Note IV. For all such theories, or better for all notions of choice sequence 
a considered, we have 


— VadeVx3z[T(e, x, a)Aa(x) = U(z)]. 


This is natural. For instance, for dice a (or lawless sequences) you don’t 
expect to prove that successive values of « will follow a recursive, or for 
that matter, any law. Nor will the thinking (freely creating) subject 
convince himself that his (mathematical) behaviour is subject to such a 
law! 

What is more reasonable to assume, is the consistency of the schema (*) 
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in §3 for relations defined by use of quantifiers for choice sequences. 
The bulk of the theories (cf. Technical Note IV) are reduced to the particular 
theory IDK in such a way that the consistency of (*) with IDK implies its 
consistency with those other theories. I have convinced myself that Kleene’s 
original recursive realizability interpretation extends to IDK (§1 of [24]) 
by use of the trick [24], p. 235, 1. 12. (Added in proof: the details are now in 
[43], 3.7.2.) But there is room for a more complete solution of 

Problem 7. For what class of formulae is the addition of Church's thesis 
to IDK conservative?? 

(c). Kripke’s schema. The schema is justified by reference to the 
‘thinking subject’ or, more objectively, to the analysis of mathematics into 
@ stages ([23], p. 179, 1.—7); but it is stated in the ‘usual’ language of 
intuitionistic analysis. For species ¥ and constructive functions f (not only 
choice sequences) we have a ‘strong’ form (KS*) 


VXIfVnLX(n)] — Im(f(m) = n)] 


(proved consistent by Scott [36], with various basic axioms, cf. Technical 
Note IV) and the ‘weak’ form (KS) 


VX3fVn([am(f(m) = n) > X(n))A[— am(f(m) = n) > A X(n)) 


(which follows from the axioms [23], p. 159-160). Even (KS~) is incon- 
sistent with Church’s axiom VfseVn3p [T(e, n, p)A U(p) = f(n)].*° 

However, I do not know the answer to 

Problem 8. For §R in the (restricted) language of analysis (not containing 
the primitive +, of [23]), is (KS), together with the axioms of Technical 
Note IV, consistent with Church’s schema, and is the system closed under 
Church’s rule? 

Discussion. The assumptions used in deriving KS~, namely thinking of 
the body of mathematical evidence as arranged in an w order, seem arbitrary 
(though not absurd) if, as in the theory of ordinals, one also thinks of 
individual proofs as consisting of a transfinite sequence of steps ([3], 
footnote 8). Therefore the inconsistency of (KS) with Church’s thesis does 
not, I think, refute the latter conclusively. In any case it leaves open the 
question whether constructive rules f defined without reference to such an 
hypothetical w-order of mathematical proofs are equivalent to recursive 
functions. We therefore turn now to an area of constructive mathematics 
which does not assume such an order, yet looks promising, that is, it looks 
abstract enough not to be reducible to mechanical operations. 
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5. Abstract constructions and proofs. See for reference [21], pp. 125-127 
or Troelstra’s course at the present conference and Goodman’s lecture 
[8].1? It is useful to think of the latter as a setlike, of the former as a classlike 
theory, where constructions correspond to sets, notions to classes (p. 126).‘? 
For the present purpose it seems slightly more convenient to use the classlike 
formulation. Let us scrutinize a bit the basic relation: 

For some given notion a, the construction (more precisely, judgement) 
¢ proves ad for variable d. 

For the discussion below it is quite sufficient to consider a particular 
notion « which we understand. More generally, there is nothing to force 
us to look at any notion « or any object d except after we have convinced 
ourselves that everything in sight makes sense, provided we have also 
convinced ourselves that the closure conditions (axioms) explicitly formu- 
lated are satisfied. After all, this freedom is one of the principal virtues of 
quantifier-free systems such as theories of constructions. 

The principal issue is this: 

If we think of the variable d as ranging over the, so to speak, absolutely 
unattained universe of all constructions, it seems dubious that there should 
be any construction (something that we grasp completely) which proves 
ad, even if we have convinced ourselves that, for any clearly given d, « 
is indeed a well defined notion. (In set theory: even if x € y makes sense for 
each pair of well defined x and y, why should we be able to put together, 
here and now, all x: x = x, or x: x ¢ x?). 

On the other hand, if we take some particularly simple notion ad, say 
Bd > Bd (where I use > for truth functional implication and, if one wants 
to, 4 and v for truth functional conjunction and disjunction respectively) 
we simply have a proof. Whatever else may be in doubt, we have a perfectly 
clear idea or ‘schema’ for verifying Bd > Bd. The kind of judgement 
involved here plays the same role among proofs as, say, the identity operator 
plays among functions. It is simply a mindless ritual to chant: for each type 
we have a different identity operator. (Though, trivially, for each domain D 
the set of pairs {<x, x>: x € D} depends on D.) 

The obvious and immediate conclusion is: just as there are some opera- 
tions which are defined for arbitrary operations (in the non-trivial sense of 
giving distinct values for ‘lots’ of arguments, e.g. the identity operator, 
the composition operator etc.) so there are some notions « which can be 
proved by constructions to hold for unrestricted d. The definition of other 
operators depends essentially on a given domain (‘essentially’ in the sense 
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that the function is made total by a trick of, say, defining its value to be zero 
outside the given domain; cf. [21], 2.151, pp. 124-125). In the case of 
notions «, the corresponding restriction concerns the variable d. 

Example (cf. [21], p. 126, 2.213). Suppose the variable d does not occur 
in «, and « Df has been formally derived by a derivation p. Now, if this p 
uses only very elementary principles (of propositional substitutions), by 
the above we may infer a > x(a,; d+ Bd) in the notation loc. cit. In words: 

If « then the proof a, (described by p) establishes f for variable d. 

If, however, the formal derivation p itself uses principles such as the 
reflection principle and f refers to properties of the notion of proof itself, 
we should only infer, for a given (grasped) domain D: 

If « then the proof a, establishes de D > fd, for variable d. 

Present development of the theory of constructions is unsymmetric with 
respect to functions and proofs. Several non-trivial total functions are avail- 
able (such as pairing and inverse pairing functions in [21], p. 126, 1. 6, 7, 
superseded by Goodman’s much more thorough [8]'*), but next to nothing 
is done by way of analysis of judgements. The device (cf. [21], p. 126, 1.—12) 
of using the formal derivation p as a name of the proof a, is a purely formal 
trick. 

Grasped domains and the theory of species of natural numbers. Clearly 
what is required is an analysis of the kinds of restrictions D (if any) needed 
for given p, and principles, corresponding to set theoretic existential axioms, 
for stepping from given D to ‘bigger’ ones. This matter is wide open. But 
for orientation (and also for its intrinsic interest) let me consider the 
particular abstract assumptions sufficient to derive the formal laws of the 
theory of species [in § 4(a)]. The present discussion supersedes [21], p. 126, 
2.215 and [25], p. 429. 

We suppose given the graspable species of natural numbers with character- 
istic function Cy, i.e. Cyd=1 ifd is a natural number and Cyd=0 otherwise. 

The assumption is that we can grasp the notion of being a species of 
natural numbers, that is we have a characteristic function Cy, for the 
property (of d): 

d is a construction involved in the notion of species (of natural numbers). 
In words, if we understand the concept of natural number we also under- 
stand what a proof or construction implicit in the concept of natural 
number is (more precisely, normalized, irredundant constructions are 
meant; otherwise it would be absurd to suppose that we have a grasp of their 
totality). Thus, in particular 
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(i). For species c, we have: Cy,c (short for Cy,c = 1); 
(ii). For species c, and (constructions) p and x, 


(Cy, ¢Aepx) D(Cy, pP&Cyx), 


that is, if c is a species and p proves that x is in the species c, then pis involved 
in the notion of species (of natural numbers) and x is a natural number; 

(iii). We have a functor F, associating to c,c,,...,¢,, all satisfying 
Cy, 4 construction d, d = F(e;c,,.-.,¢,), such that 


Gy d —sleeandeen(p. x Cy, <0 cxc, ...¢,) = dp. 


Thus suppose the species defined by the formula 9% with free variables X, 
X; (1 < i< n) is determined by the construction c with parameters x, x;. 
Then VX% defines the species d, when X; has the value c;. 

Comment. The analysis above, like the interpretation of the logical oper- 
ations intended by Brouwer and fomulated by Heyting, uses notions which 
are more abstract than those of familiar constructive mathematics ({21], 
p. 119). The analysis has enough coherence and substance to suggest that 
there is something definite to understand here (even if, by note 3, we have 
not yet done so). But do we want to know about it, not only subjectively, 
but for getting on with the business of constructive mathematics? Not the 
possibility of understanding intuitionistic concepts, but their usefulness is 
the true issue. Dramatic exaggerations would only lead to the kind of let- 
down which Russell felt after he (or rather, according to his autobiography, 
after Whitehead) finished Principia. 

Assuming that the intentions of such an abstract analysis of the principles 
behind the theory of species of natural numbers have been properly formu- 
lated we have 

Problem 9, Is the hypothetical theory of constructions consistent with 
Church’s thesis? 

(Of course, a positive solution of problem 6 does not necessarily ensure 
one for problem 9,'*) 

Finally we have the somewhat related 

Problem 10. Is the set of derivable (variable-free) closed atomic formulae 
in this hypothetical theory of constructions recursive? 

In particular, for any given (closed) term t it should be decidable whether 
the construction intended by t proves a given assertion in the theory of 
species. And, for the formal theory, it should be recursively decidable 
whether t can be formally shown to have this property. 
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It should be noted that the ‘corresponding’ requirement is neither satisfied 
by Kleene’s recursive realizability interpretations [14] and [16] nor by 
Gédel’s functional interpretation [5] of the logical operations. Specifically, 
recall that to each formula % are associated the relations 


the number e¢ realizes 2 
and 
the functional s satisfies Vf2{o(s, ¢) 


resp., where AsVf2o(s, t) is the interpretation of M2 according to [5]. Thus, 
on the interpretations considered, the partial recursive function (with 
number) e and the functional s are the constructions that ‘constitute’ the 
proof of f (together with the judgement that these constructions stand in 
the relation above to 2). But neither of these relations is evidently decidable, 
and the formal derivability of the corresponding formulae (in the systems 
considered ) is certainly not recursively decidable. 

The decidability property mentioned in problem 10 therefore distinguishes 
(what I conceive to be) the intended intuitionistic interpretation of the 
logical operations from some other constructively meaningful ones. The 
property may turn out to be a useful adequacy condition. 

Discussion (continuing the comment of p. 131): The notions here considered 
need not be analyzed before one can do constructive mathematics. The 
principal interest here is philosophical: not to confine oneself to what is 
necessary'* for (current) practice, but to see what is possible by way of 
theoretical analysis. And there are surely many who have an interest in 
the abstract notions, but who, for temperamental or for other reasons, 
had best wait till the theory is technically more developed. After all there 
are a lot of other things to do in constructive mathematics; intentionally 
I chose for my other talk [26] a topic at the opposite end of the scale dealing 
with restricted notions of proof and small ordinals. 


TECHNICAL NOTES 


Notes J and II arise directly from the project described in the main text. 
They derive constructively meaningful results from arguments in the 
literature which were not only prima facie non-constructive but (it is fair 
to say) seemed to have a purely technical interest. In Note II Spector’s 
important paper [37] is used to derive for full classical analysis a result 
whose interest is independent of the notoriously problematic constructive 
validity of bar recursion for all finite types (cf. [25], p. 421). Notes HI and 
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IV concern some notions of choice sequence and their formal theories, 
summarized in the chart at the end of the paper. 


I. Non-enumerability of the species of valid formulae of predicate calculus 
by a recursive function (under Church’s thesis). We combine the ideas of 
[19] and of Mostowski [30]. 

Consider any primitive recursive binary tree T. For a suitable formula 
or (cf. [19]) containing function symbols, the assertion 


all constructive paths through 7 are finite 
is equivalent to 
@r is (contructively) valid. 


The proof of equivalence uses the facts set out in note 7; the result stated 
holds also if ‘finite’ is replaced by ‘not infinite’, and “7, by ‘= — $7’. 
wr has the form 11 > © where 11 is universal and € is existential, and so 
1-7 %r has the form — (11A1,) where 11, is universal and equivalent 
to — &.1 


Mostowski’s general theorem states that the sets of formulae 


{2: Qf is true in all r.e. models} 
and 
{2: QW is false in some primitive recursive model} 


are not arithmetically separable. Inspection shows that, for 9% of the form 
— (UA U,) above, the sets above are not separable by an rue. set, i.e., an 
r.e. set including {%,: Gr is true in all r.e. models}. 

REMARK. The abstract recursion theoretic result is this. The sets of 
primitive recursive trees 


{T: all recursive paths are finite} 
and 
{T: there is an infinite primitive recursive path} 


are effectively not separable by an r.e. set. 

From this classical result we now get the corresponding intuitionistic 
result as follows. Given any proposed r.e. separation, say w,, we find 
explicitly 7, and a classical proof of 


T.€@, and not all recursive paths are finite 
or 
T.¢@, and a particular primitive recursive path p, is infinite. 
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Hence, intuitionistically, it is absurd that 
(i). —1— T, € w, > (all weakly recursive paths of T, are not infinite) 
and 
(ii). = T,€@, > (p, is infinite), 
where {e} is called weakly recursive if Vx = — dyT(e, x, y). Now (i) 
implies 
— — T,€ @, > (all recursive paths of T, are not infinite). 
We now conclude the required result since (ii) implies 
(all recursive paths in 7, are not infinite) > — — T,€ a,. 


For this argument it would not have been sufficient to have a classically 
recursive, that is intuitionistically weakly recursive, path in place of p,. 
So one needs here in an essential way Mostowski’s inseparability result 
in place of the more ‘natural’ result 


{Q: Qf is true in all recursive models} is not arithmetic, 


while in Mostowski’s original formulation, the inseparability property had 
the role of a refinement. 

REMARK. A common misunderstanding is avoided by distinguishing 
between completeness and soundness on the one hand and faithfulness on 
the other. For each % in the language of predicate calculus, let the formula 
%(Q) of a formal theory of species, say , define the validity of UW. Ff is 
called faithful, provided that, for each Y, 


Y is derivable in Heyting’s predicate calculus iff (2) is derivable in SF. 


Despite the impossibility of establishing completeness (in any existing 
system) there are quite elementary formal systems for species or, for that 
matter, for abstract constructions which are faithful. The situation is 
completely parallel in the case of classical logic. Let By(), in the language 
£ of set theory, define classical validity. Now for set theory without the 
axiom of infinity we have 


¥~(X) provable if and only if 9 is derivable in predicate logic, 


but not 
By(H) > (A is derivable in predicate logic). 
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(Take for Q{: every ordered set has a least element.) On the other hand 
soundness is derivable without the axiom of infinity. 


Il. Theory of species: closure under Church’s rule. The present note 
contains sketches of proofs (but for the theory / of species without axioms 
of choice; cf. also the postscript on p. 147). 

For each finite subsystem ¥, of the theory & of species of natural 
numbers, we can formally prove in the latter 

(i). for primitive recursive U(x, y), if Vx — — Iy%(x, y) can be proved 
in ¥, then VxdyW(x, y) can be proved in * (where YW is a canonical 
definition of the primitive recursive relation considered), 

(ii). for arbitrary W(x, y) containing x and y as its only free variables, 
if VxdyU(x, y) can be proved in , then there is a number e such that 
Vxdz[T(e, x, z)A U(x, U(z))] 

can be proved in Y. 

A corollary to (i) is of course, that exactly the same equations can be 
proved to be recursion equations in full classical analysis and in the theory 
of species. (It is well known that this class of equations is stable in the 
sense that it is unchanged by adding the assumption that all sets are con- 
structible. ) 

To prove (i), we use Spector’s paper [37], together with a model of bar 
recursion of finite type in the theory of species (but using only the compre- 
hension principle). For by [37], if Vx — — 4yW(x, y) is proved in the 
theory of species, there is a term t of the theory of bar recursion of finite 


type such that 
W(x, t(x)) 


is derivable in the latter. So all we need is a valuation function in the theory 
of species for terms t(x). (All species needed here are explicitly defined. 
I do not know if [32] can be extended to include axioms of choice.) 

(ii) follows from (i) together with suitable refinement of work presented 
in the lecture by Prawitz [32] (or Scarpellini [33], whose exposition was 
however less clear). It is sufficient to formalize the proof of cut elimination 
(in the sense of Prawitz [32]) for systems Y,. Given a derivation of 
Vxdy(x, y) in &, we get derivations in second order logic of 


SS Oey Ntst0, Vj, m= 0, 12) 0a, 


where 8(s, 0) is the usual first order axiom for zero and successor, and YW, 
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is got from % by letting the variables for individuals range over the least 
species closed under successor. Cut elimination provides, uniformly in 7, 
terms t,, built up from 0 and s (and possibly variables). Since the reflection 
principle for /, can certainly be proved in #, and since we have an 
enumeration of t, primitive recursive in the function which maps proofs 
of S, with cuts into cut free proofs, we get a proof of Vx2(x, {e}(x)) in 
F for suitable e. 

It remains to note that for finite subsystems of second order logic that 
is restricted to comprehension axioms of limited logical complexity, 
Prawitz’ proof can be carried out in classical analysis itself. But the statement 
of cut elimination is of the form VxdyM(x, y) with primitive recursive YU, 
and so we can apply (i). 

It may be observed that a completely analogous argument solves 
questions 1, 2, 3(ii) on p. 350 of [44] positively, and establishes for classical 
analysis the finitist equivalence of cut elimination and 1-consistency of 
analysis; that is, if w maps proofs with cuts into corresponding cut 
free ones, some o maps proofs n of 4x%x for primitive recursive 2% into 
numerical realizations on of x, such that this o is primitive recursive in 
y, and conversely. 


REMARK. There is an amusing parallel between the formally quite different 
work of Prawitz [32] and Scarpellini [33]. Both start with a theorem first 
proved for classical logic and, in the classical context, of very limited use. 
Prawitz takes cut elimination in the sense of Takeuti, in which the most 
important properties of usual cut elimination (such as the subformula 
property) are lacking. Scarpellini takes Gentzen’s second consistency proof 
which, as it stands, merely provides a reduction procedure for derivations 
of purely numerical formulae. (The existence of some reduction procedure 
is an immediate consequence of consistency, and so the interest of Gentzen’s 
result depends sensitively on the particular reduction and the particular 
principles used for establishing the termination of the reduction procedure. ) 
In contrast, when applied to a wide class of formal systems based on the 
rules of Heyting’s predicate calculus, the ideas used in the two ‘unemployed’ 
proofs mentioned above, establish closure under Church’s rule. The rule is 
genuinely problematic. First, even if the theory of species is seen to be 
constructive, Church’s thesis is certainly not evident (cf. problem 6). Second 
since the systems are of course incomplete, even if (for the intended inter- 
pretation) Church’s thesis holds, i.e. formal derivability in SY of VxayU(x, y) 
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implies the validity of deVx4z[T(e, x, z) a U(x, U(z))], there is no guarantee 
that the latter can be derived in - itself (cf. also problem 5). 


III. Scott’s model [36] and the theory of lawless sequences. (The present 
Note is not essential to the main topic of this paper; a mild connection with 
Church’s thesis will come up at the end of this Note.) The model, found 
by Scott some time ago, is naturally interpreted in terms of the theory of 
lawless sequences of natural numbers and constructive functions, more 
precisely, of constructive functions of finite type over the natural numbers 
(that is, §1 of [24] extended conservatively to finite types, as in [5]). The 
interpretation follows the lines of [18] where lawless sequences were first 
used to extract intuitionistically meaningful results from work on the 
topological interpretation. From the present point of view Scott’s [36] 
provides, to date, by far the most advanced development of any theory of 
derivatives of lawless sequences, also called ‘compounds’ ([24], p. 244) or 
‘projections’ (by van Dalen and Troelstra [39]); cf. Note IV for some general 
remarks on such derivatives. We now consider Scott’s model. 

Basic definition. A real number (generator) is a pair (a,a) (c.f. [24], 
p. 244, 4.1) where the functions a considered are obtained by composing 
an arbitrary continuous function (element of K) with a projection into the 
usual spread o of convergent sequences of rationals, and so a is a continuous 
(constructive) function with arguments in N™ and values in o. 

In other words, frezly chosen real numbers are regarded as ‘projections’ 
of some lawless sequence « by means of a. 

With every formula in Scott’s language of real numbers [36] is associated 
a formula of [24]; and with any formula in the language of functions of real 
numbers is associated a formula in the language of [24] but (as mentioned 
above) extended by variables for constructive functions of higher type as 
follows. 

Instead of associating an open set <c N* with a prime formula x < y, 
one takes the proposition, with parameter «, 


JnamVp > m[(a, «)(p)+n7' < (b, «)(p)], e 


where a and b are associated to the variables x and y respectively, (a, a.)(m) 
being the mth (rational) element of the path in the spread o defined by 
(a, x). Note that we use just one parameter « throughout (in contrast to 
the derivates of [24] or the projections of van Dalen and Troelstra [39]). 

It is then a theorem of the theory of lawless sequences that, for given 
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a and b, the species of « satisfying (*) is the open set [x < y] ¢ N%, and, 
for logically compound formulae Y, if 2’ is the formula associated with 1, 
eo Sh 
provided the quantifiers Vx and 3x are replaced by Va and Ja respectively. 

Finally, if 2( is closed, truth of 2[ in Scott’s model is formally equivalent 
to Va’. 

Similarly, for formulae containing variables for functions for real numbers, 
the application operation is defined in the natural way, and the functions, 
that is mappings of continuous functions a (in the basic definition) into such 
a, satisfy the formal axioms of extensionality and comprehension formulated 
by Scott [36]. 

Main result. By the elimination of variables for lawless sequences [24], 
the basic theorems of Scott’s paper (such as Kripke’s schema or Brouwer’s 
continuity theorem for extensional functions defined on a closed interval of 
the continuum) are equivalent to assertions in the language of constructive 
functions (and functionals of lowest type). 

What is still open is, whether the latter assertions are constructively valid; 
Scott promises to analyze this matter. But even as it stands his work 
establishes the consistency of a formal theory of choice sequences including 
Kripke’s schema (cf. the chart in Note IV) for which previously there was 
no wholly convincing proof. The consistency follows from the fact that the 
assertions involved (in the language of constructive functions of finite type) 
are theorems of classical analysis. 

It should of course not be assumed that the proofs in Scott’s paper [36], 
as it stands, are all constructive. Further one would expect that essentially 
different assumptions about constructive functions will be needed to establish 
(the assertions equivalent to) Brouwer’s theorem and Kripke’s schemata 
(in weak or strong form of § 4). The latter is of course of special interest 
here because it conflicts with Church’s thesis. 


REMARK. Scott [36] points out that he had thought of his model some time 
ago but did not pursue it until encouraged by experience with boolean valued 
models (though he does not use any technical results about such models). 
It should be added that another, perhaps more objective, obstacle had been 
overcome in the mean time. Until Myhill’s critique (cf. [23], p. 173, 
elaborated in (31]) the axiom of choice, for continuous f, was taken in 
the form 

VxdyW(x, y) > Sf Vx2(x, fx) 


B VII CHURCH’S THESIS 139 


without requiring uniqueness on y; see e.g. [16] or [21], p. 135, 2.511. 
(Incidentally this was done despite the fact ([21] p. 133 top) that this form 
fails in the model consisting of what I myself called ‘continuous functionals’ 
in [17], cf. also the discussion in Note IV.) The unrestricted form fails in 
Scott’s model [36]. To regard the model as anything beyond a formal 
exercise it was necessary to give an informal analysis of the intended notion 
of choice sequence to show that the form above not only fails in the model, 
but fails for the intended notion. The existence of a continuous f can be 
asserted only if y depends extensionally on x, and one (simple) way of 
ensuring this is to strengthen the hypothesis to Vxd!y%{(x, y), when the 
axiom holds in Scott’s model. (Only in the special case of lawless sequences 
is extensional dependence automatic, cf. [24], p. 238.) 

Correction. Though I myself have not had doubts about the legitimacy of 
the notion of lawless sequences ([18], p. 378, 1. 10-12), as recently as 
January 1964, when I gave the lecture [21] (p. 142, bottom), I questioned 
their usefulness. 


IV. Notions of choice sequence and their formal theories. The purpose of 
this Note is to expand a side remark in § 4(b) by describing (my views on) 
the role of choice sequences in constructive mathematics. 

Contrary to popular first impressions, a little reflection shows that 
we have a reasonably clear notion of choice sequence; more precisely (as so 
often, cf. the notion of set in [27], pp. 171-173) we have a crude mixture of 
notions which have to be separated by distinctions. Further it appears 
(again as so often) that there are relatively few basic ingredients of the 
mixture from which the rest can be defined. Several papers in the present 
Proceedings set out, axiomatically, properties of such basic notions. 
Personally I believe that the notion of choice sequence is of particular 
interest in formulating geometric conceptions within constructive mathe- 
matics ( [24], pp. 246-247, though I have not been able to develop this idea). 
This last remark suggests the following point of view: 

Granted that we understand such notions of choice sequence, we may 
ask whether they are reducible to notions used more widely in constructive 
mathematics. This would be the analogue to reducing the (geometric) 
continuum to set theory, in the sense that (i) it is the only structure satisfying 
Dedekind’s axioms and (ii) there are set theoretically defined structures 
which satisfy these axioms too. In other words we have sufficient properties 
of the continuum to characterize it uniquely up to isomorphism (or, 
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equivalently, characterize it as far as any extensional statements about 
it are concerned). Correspondingly, in the intuitionistic case, as soon as 
the notion of abstract construction was proposed as an analogue (for 
constructive mathematics) to the notion of set in usual set theoretic founda- 
tions, one of the first questions to consider ({20], p. 208, footnote 11) was 
the reduction of choice sequences, since the latter are not constructions. 
Only ‘reduction’ may here have to be understood in a somewhat weaker 
sense (with respect to statements in a specified language only). The so-called 
elimination theorems provide precisely this kind of reduction. (Not sur- 
prisingly they also have formal applications, in establishing the proof 
theoretic strengths of systems for choice sequences.) 

We now list the principal formal systems with a kind of running commen- 
tary. 


Formal relations between theories of choice sequences. 

H,: elementary analysis [12]. 

H’: an isomorphic copy of H, but without (even) the axioms asserting 
the existence of pairing functions or closure under primitive recursion; 
thus H’ is satisfied by lawless sequences. 

H: includes both H, and H’, but also countable axioms of choice 
Vndm, Vnix. 

B: theory of constructive functions and Brouwer operations ({24], § 1); 
(B is sometimes called: IDK.) 

L: theory of lawless sequences, [24], § 2. 


Kinds of choice sequences 

2: closure of choice sequences under mappings of spreads into spreads 
(cf. [21], p. 135, and [24] p. 243, footnote 9) 

I’: closure of choice sequences under arbitrary continuous mappings [38]. 
These two conditions hold respectively when a choice sequence is given 

(i). by a spread 
and 

(ii). as the image of a continuous mapping. 
The corresponding further axioms are 

S: a property of a sequence holds in some spread to which the sequence 
belongs ([21], p. 135). 

C: a property of a sequence holds for the whole range of some continuous 
operation (which range contains the given sequence), [38]. 
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Closure conditions on choice sequences 


V4!: obtained from H, by adding the assertion that there exists a 
continuous ‘choice’ function if 2(a, x) is extensional and Vad!x2(a, x). 

BI!: bar induction with uniqueness in the premise or, equivalently (cf. 
[24], p. 233, bottom), decidability of the set of secured sequences. This last 
condition was mentioned in Brouwer’s statement of the bar theorem [3], 
but not explicitly used in his own arguments. In fairness to Brouwer it 
should be said that, in his statement, he did not use the quantifier combina- 
tion Vadx but spoke of functions (on choice sequences). But the time 
honoured condition on a functional relation 2{ requires uniqueness Yad!x 
and so, as mentioned already, decidability of 2{. See also note 40 of [26]. 

V4: corresponding to V3!, but without uniqueness (and V4, when adjoined 
to S). 

BI: corresponding to BI! without uniqueness. 
Discussion of BI and W4, which (in contrast to BI! and W3!) are not 
immediately evident and in general not true for many of the clearly analyzed 
notions of choice sequence: Historically they were almost certainly 
introduced as the result of an oversight, ignoring (cf. Note III) (i) that 
obviously only extensional functions could be expected to be continuous 
and (ii) that, even if (x, y) is extensional and Vx4y%(x, y) is valid, there 
is not necessarily a continuous f: x + y such that Vx (x, fx). 

Illustration (by analogy): Let xX = ¢X,,X2)>, y = (1, Y2> range over 
pairs of r.e. sets. Let W(X, ¥) express 


X, UX. = Py VU a5 P1 SX, 2 SC X25, NV. = 9G. 


Then, by Rosser’s trick there is a recursive function f from (pairs of ) Gédel 
numbers x of X to Gédel numbers y, but no effective (extensional) operation, 
i.e. no recursive f such that if 


a cee then (ey = (fe peand (C), SK 


Since in general two wrongs don’t make a right, BI and V4 do not become 
interesting by simply adding (heavy formal work) to the original oversight 
that led to their choice. But the systems may have interest for two quite 
independent reasons. Philosophically, since Troelstra [38] has at least 
made plausible that, under certain restrictions on the possibility of com- 
municating rules, BI and V3 may be valid after all. Proof theoretically, 
because the systems considered (and even Kleene’s subsystems [15] excluding 


142 G. KREISEL B VIII 


C) are closed under Church’s rule, and so, whatever their interpretation, 
they may be useful examples in connection with problem 5. 

Correction. In my joint paper with Howard [12] the significant difference 
between WJ and V4! is overlooked. By a fluke the results are not meaningless 
because only numerical existential quantifiers Vfan%(f, 2) occur and the 
distinction between decidable and undecidable is made: by the above this 
happens to be equivalent to the distinction between WJ and W3!. 

All the systems below (except suitable strong forms of Kripke’s schema 
KS) are interpretable in B; in particular all statements in the language 
of B which are derivable in any of the systems are also theorems of B. 


Legend. If a system ¥ is below /’ and joined to * then Y includes /’. 
A broken line indicates mutual inconsistency (inconsistency of S and C, 
by [38], is only known on the basis of certain additional axioms for con- 
structive functions, e.g. Church’s thesis). 


NOTES 


1. More precisely, (1) and (ii) are not exactly parallel. Membership and classical logic 
are kept fixed in (i), and so relativization to L determines unambiguously the meaning 
of formulae in the language of set theory. But in (ii), not only the notion of constructive 
number theoretic function is restricted, but the meaning of the logical particles is altered 
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somewhat arbitrarily. Specifically, even if one considers the identification ‘constructive = 
recursive’, and R(e,«) stands for ‘e realizes «’, it is arbitrary to put R(e’, « > f) = 
Ve[R(e, x) > R({e’}(e), B)], that is to interpret deR(e, «) > de’ R(e’, 8) by de’Ve[Rle, «) 
— R({e’}(e), B)], unless R(e, «) is assumed to be decidable. The value of e’ need not only 
depend on e, but also on the proof of R(e, «). To make the parallel complete one has to 
use an abstract theory of constructions as e.g. in Goodman’s [8] in place of Heyting’s arith- 
metic. Then all prima facie higher type objects are required to be coded by natural num- 
bers, and all functions are taken to be recursive. (This ‘reduction’ to numbers is essential 
when Church’s thesis is regarded as a reducibility axiom.) 


2. Historically the choice of name (e.g. [10], p. 190) is perhaps dubious. Turing’s 
assertion was only that all mechanically realizable functions are recursive. Here we have 
the additional assertion that all instructions defined in terms of constructive notions are 
mechanically realizable. But historically it is equally dubious whether the distinction was 
recognized when Church formulated his thesis. Since in the twenties and early thirties the 
abstract character of intuitionistic concepts was not realized, the concepts finitist and in- 
tuitionistic were often identified (e.g. von Neumann [40], p. 7, 1.19 or Herbrand [9], p. 
210 and the (Somewhat modified) footnote 3 on p. 225), and so probably constructive 
and mechanical, i.e. formal, rules would have been identified too. 

Digression. Besides the two notions of mechanical and constructive rules we have the 
notion of physically realizable (number theoretic) function, that is realizable according to 
current physical theory (cf. [21], p. 144, elaborated in [22], § 4). It seems to be open wheth- 
er there are (finitely specified) physical systems whose most probable behavior is non-re- 
cursive, in the sense that printing the course of values of a (recursive) function, programmed 
ona computer, is the ‘most probable’ behaviour of the latter (regarded as a physical system). 
The theory of partial differential equations gives a negative answer for a general class of 
systems in classical mechanics. This result is not trivial since we are dealing with the 
mechanics of continua and Turing machines are discrete mechanisms. For quantum 
physics our question above seems to be open. It may be remarked (for an earlier discus- 
sion see [29]) that the celebrated three body problem suggests a more specific question. 
Given three neighborhoods U,, U,, U3 in phase space (the mass of particles being includ- 
ed) and an integer n can we always determine recursively neighborhoods V;, V2, V3 with 
the following properties: either V; < U; (=1, 2, 3) and with initial conditions inV;, V2, 
V; there is no collision up to time :—n7~!, or the differences V;— U; (}=1,2,3) are < 
and for some positions in V,, V2, V3 there is a collision before +-n-1? As far as I know 
this problem is open. It would be of interest to examine whether, in case of a negative 
solution, one could use this situation for an analog computation of a non-recursive func- 
tion (by repeating collision experiments sufficiently often). 


3. Though some understanding of this notion and of the history of its analysis certainly 
helps in thinking about the theory of species of, say, natural numbers in § 5. Note however 
this difference between our understanding of abstract sets and abstract constructions: 
while possibly our formal theories of constructions are approaching the standard reached 
in [41], our understanding of their meaning is far short of the analysis of the cumulative 
hierarchy in [42]. 

4. So to speak: ona priori grounds. Granted there are ideas which some people thought 
unambiguous and which on analysis turned out to need distinctions. Why shouldn’t there 
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be other ideas which (perhaps for doctrinaire reasons) people think vague, but which on 
reflection turn out not to be so? If the man in the street is shown examples of convexity, 
they strike him as indicating a vague concept, certainly not as one definable in familiar 
geometric terms. Another (highly relevant) example that is often overlooked (but cf. 
{21], footnote 14 on p. 122) concerns the identity of proofs; in arguments about priority 
mathematicians discuss, in patently objective terms, whether two proofs are essentially 

identical. 

5. There are significant variants according to whether 4x2 can only be inferred from 
M(t) for some term t, or also from Y(t) V 4x%. For instance, in the latter case one can 
use primitive recursive proof figures, in the former not generally. Correction. In [21], 
p- 165, 3.3321 the interest of this difference was not properly considered. 


6. Problem 2 is strictly formal only for the case of extensional equality, because of the 
abstract character of the concept of definitional equality between rules, say [f], denoted 
by terms f (of some formal system). The intended rule will in general involve a specific 
but not necessarily mechanical reduction procedure. Evidently a (syntactic) relation = 
between terms may satisfy given axioms without satisfying 


Vt Vto(ty = ft, ar [tr] = [t2]). 


It seems plausible that some relations = in the literature (cf. e.g. Howard’s lecture[11]) 
do correspond to the intended {(f,, f2): [t:] = [t,]}, though no argument is given. (It 
seems to me perfectly coherent to reject the notion of definitional equality as simply not 
being necessary for existing mathematical practice. But something close to the sin of ul- 
timate despair is involved in both rejecting the notion and going through the labour of 
working out the formal theory of =). 


7. The evident facts used, though not listed explicitly loc. cit., are: 

(i). the intended meaning of completeness, p. 139 (1); 

(ii). any species satisfying the axioms 8 (for the fragment of arithmetic) (pp. 142-143) 
contains a subspecies (constructively) isomorphic to the natural numbers, 

(iii). for species X of natural numbers and constructive number theoretic functions f 


VX[Vn(X(n) vy SX) > If Vn(f(m) = 0 & X(n)]. 


(iv). the principle of dependent choices in the form: for species D (‘D’ for domain) and 
P of one and two arguments respectively, and functions f with numerical arguments 


Vx(Dx > dy[Dy A P(x, y)]) > Vz(Dz > Af Wal[fO = z A DUfn) A PCfn, fnt ))). 


In short, the connection between Church’s thesis and the notion of constructive validity 
(though ipso facto not this notion itself) is insensitive to the exact meaning of the basic 
constructive notions; cf. e.g. [22], p. 254 concerning first order validity in the classical 
case (which requires ‘little’ about the notion of set except of course that infinite sets are 
included). 

8. Let us not forget our theme of comparing Church’s thesis to V = L. Consider the 
converse: what corresponds to Church’s rule in the case of V = L? In general, from a 
proof of IX 2(X) we cannot infer the existence of a constructible X, say, such that (X,); we 
have to restrict 2{ to be invariant (for extensions over the constructibles). Using note 1 on 
p. 142, compare (i) some X, satisfies 21 in L with (ii) the comment to problem 3 on p. 126. 
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9. The main purpose of the consistency results is to help avoid fruitless lines of research, 
since our principal interest is the refutation of Church’s thesis (as formulated here). Note 
that consistency results exclude even a ‘weak’ refutation in the sense of showing the ab- 
surdity of a proof, not only a ‘strong’ one in the sense of exhibiting a counterexample of 
the kind considered in § 1. 

Correction. In contrast to § 1, and despite its title, Kalmar’s [13] contains no trace of 
any uniform, mechanical or non-mechanical, procedure which might conflict with Church’s 
thesis (in any of the senses of note 2, p. 143). Note also that [13] explicitly insists on using 
classical logic; a ‘conflict’ must be established by a counterinstance, and so even the assump- 
tions about the thinking subject which, by Kripke (cf. note 10 below), establish the absur- 
dity of proving Church’s thesis, are not enough for Kalmar’s purpose. It may fairly be 
said that [13] does not provide a framework within which one might even begin to 
refute Church’s thesis. 

10. Since Kripke’s result is (correctly) attributed to him, but his argument does not 
seem to be in print, here is a proof of the result. We only need a specific XY, say Kleene’s 
Vy— T(n, n, y), Xo for short. Kleene shows that X9 is not r.e.. Let us suppose the argument 
uses the principles of classical first order arithmetic. Writing WU(e, m, n) for 


Ag[T(e, m, 9) A U(q) = nl, 


we observe that both dame, m, n) > Xo(n) and 7 Ame, m, n) are (equivalent to) for- 
mulae in the fragment (7, A, ~, V). Now Kleene shows for variable e 


Vim 77 AqT(e, m, g) > 1 Wn([AmU(e, m,n) > Xo(n)] A [7 AmUCe, m,n) > 7 Xo(n))). 


Since Xo (and hence — Xo) are also in the fragment above, we have a proof in intuitionistic 
first order arithmetic of Kleene’s result, and a fortiori one with the stronger premise 
VmdAgqT(e, m, q). 

But this patently contradicts Church’s axiom under the assumption (KS~). 

11. Correction. J shall refer principally to these formulations and not to my earlier 
[20]. It is true that [20] gives a good explicit analysis of the meanings of proposition and 
species (pp. 201-202), correctly stresses the need for total functions (p. 206, 1-17) in a 
genuine analysis of implication, and generally gives the primitive notions (p. 205) in terms 
of which Heyting’s explanation of the logical operations can be built up. But the details of 
the formal language in [20] simply do not correspond to the intentions stated. For exam- 
ple, the use of A-notation and a large number of specific function constants (p. 203, 1.3) 
side by side is absurd since the point of A-notation is to define specific functions. But to do 
this smoothly, the functions have to be partial. Consistency was ensured by a bunch of 
restrictions, which are bad on two counts. Some merely nullify the familiar use of A-no- 
tation (which ought never to have been introduced at all), others (e.g. p. 204, 1. 20) 
which do have a sensible meaning (for instance, p. 204, 1. 20, which applies to notions 
of proof discussed in my other lecture, where only closure conditions on proofs can be 
given since the totality of proofs considered is not assumed to be grasped) are not explain- 
ed. 

There is an additional distinction which has so far not been formally necessary, but 
which is probably important, for example in the explanation of implication (or universal 
quantification). When we think of the pair (p,, p2) 
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P, proves the identity: for variable p, if p proves A then p2(p) proves B, 


p2 is a genuine function or operation, while p; recognizes that p2 satisfies the condition 
stated; thus p; is a judgement. But similarly, since in general both the arguments p and 
the values p2(p) of p2 are such pairs, say p = (p’, p’’) and p2(p) = (p3, py), should the 
function pj depend both on p” and on p’ (or only on p”)? It is perhaps significant that in 
Gédel’s interpretation [5] of 9%{-> ds for purely universal 2 and 8, we have 3s(% — 8), 
that is the function s does not depend on the (hypothetical) proof of 2. (it is not unusual 
that conceptually basic components of a notion are not distinguished in a formal theory; 
cf. the type or rank of a set in set theory.) 


12. It is trivially simple minded to transfer thoughtlessly set theoretic notions to construc- 
tive mathematics. But it is equally simple minded to dismiss thoughtful attempts in this 
direction as ‘mere analogies’ for the following theoretical reason: Most mathematicians 
do not have a clear conception of the nature of the objects they are talking about. So if 
some notion or distinction has a very wide immediate appeal, the chances are that it has 
a meaning for different conceptions of mathematics. Evidently two provisoes follow: (i) 
the transfer can be expected to be valid, say in the case of a familiar set theoretic notion, 
only if the latter is defined in the natural (not in some other, classically, equivalent) way 
(ii) the notion presents itself naively in mathematics, and does not have an explicit foun- 
dational purpose. 


13. It should be noted that Goodman’s axioms are valid for the very abstract notions 
of proof and construction here considered, although his own intended notion (‘semantics’) 
concerns particularly elementary and explicitly described manipulations. 


14. By now the reader will be able to pursue our theme for himself Gif he wishes to do 
so), that is to state the analogue in set theory for V = L corresponding to problems on 
Church’s thesis. Actually, several passages in [21] have been intentionally set out to have an 
obvious analogue for the theme considered, for example, pp. 105-106, 1.52, 1.53, 1.533. 


15. To avoid a very common misunderstanding a distinction has to be made which 
in turn is very much in need of precise analysis. What is ‘necessary’ for practice? 

(a). There is clearly a certain combinatorial element in mathematical reasoning, which 
is intuitively best reflected by stating implications 2{ > $8 using only the most elementary 
logical principles even if 2{ can be seen to be valid. As people put it, all the ‘hard work’ or 
(when we eliminate logical symbols altogether in favour of functions; cf. Bishop’s lecture 
(2]) the numerical content may reside in the construction leading from 1 to 8; the nature 
of the verification of 2 is often quite different. As far as classical] mathematics is concerned 
instead of taking ZF as ‘framework’, one could stay within predicate logic, and as far as 
constructive mathematics goes, some quantifier-free theory like Gédel’s T [5] (perhaps 
even without induction) is an immediate candidate. Naturally if one takes the (psycholo- 
gical) analysis of ‘hard work’ seriously the choice of framework is the central issue because 
it expresses one’s view of the nature of the ‘hard’ part of mathematical activity. 

(b). Besides the question of discovering unconditional assertions % there is also the 
problem of making the choice of notions intelligible. It is one thing to point out (correctly), 
as Bishop does, that Brouwer’s assertion concerning the continuity of (extensional) func- 
tions does not really affect mathematical practice, in sense (a), if we simply take our func- 
tions as given together with a modulus of continuity. But it is a separate matter to explain 
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this step by showing that any definition satisfying some abstract condition is bound to 
provide the additional information; in other words by analyzing (when possible) the most 
general notion of construction, not merely definitions in some formal system such as 
Gédel’s T [5]. Abstract intuitionistic mathematics is concerned with this kind of problem, 
which of course is not touched in (a) above. : 

16. As a result of correspondence with Professor Scott it seems desirable to add some 
comments. 

(a). Our results hold for both interpretations (i) and (i’) of validity on p. 139 of [19]; 
the crucial direction in the equivalence is the derivation of ‘47 %r is valid’. 

(b). Since the domain of the variables (in %r) is arbitrary and not a ‘detachable’ sub- 
species, the desired result, even in case (i’), does not follow directly from early work on 
consistent formulae with no recursive models (e.g. in footnote 5 on p. 384 of [18]), be- 
cause the domain of these models is assumed to be recursive. 

(c). The formula 11 above corresponds in an obvious way to the formula (7) on p. 144 
of [19]. The delicate case to consider occurs if the species depend on freely chosen objects, 
say p, aS parameters and the elements of D* are such objects. Q* corresponds to a path 
through 7. We have, of course, a constructive sequence of operations a3, a%,.. with 
a‘p, a; p,... the ‘natural numbers’ of D*. 

Evidently some properties of (operations on) freely chosen objects have to be used in 
our deduction. What we need essentially is that Vp —(11* A 11*) should follow from the cor- 
Tesponding result with p replaced by constructive objects, such as the theorem (for the 
choice sequences treated in [21], p. 136, line 1 or 2, .524 or [38]): (Wa-1 Aa) + (Wa Ax) 
for constructive number theoretic functions a and choice sequences «. The reader who has 
another notion of choice sequence (and hence of validity) in mind should concentrate on 
the conjunct 

VnAlm([m = 0A D*(azp)] V Im 40 A O*(G5 py) 
corresponding to (x)[D.(x) vm D(x)] in (7) on p. 144 of [19]. If, for each n, m depends only 
on a neighborhood of p, the possible paths corresponding to different p can be construc- 
tively enumerated. Hence the assumption that all constructive paths through T are not 
infinite gives the required conclusion. 


POSTSCRIPT 


(to problem 6 and Technical Note II) 


The consistency of Church’s schema has now been verified by extending 
Kleene’s original recursive realizability interpretation to the theory of species 
((43], 3.7.3; further work needed for conservative extension results is ana- 
lyzed in [43], 3.8. Closure under Church’s rule can probably be obtained 
by modifying [43], 3.7.3 using Kleene’s (+ realizability) [15], and thus im- 
proving Technical Note II). As a result of correspondence with Dr. Troel- 
stra I can now formulate my original doubts about the consistency. 
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We know of course that the addition of axioms for species to an elementary 
theory of constructive functions is non-conservative (with respect to the 
language of the latter); new sets of equations can be proved to define 
(total) functions. To show that species have ‘nothing to do’ with functions, 
we should have to have a set F of evident axioms for functions, such that 
the addition of the usual axioms for species is conservative. But the nearest 
approximation to such an F is Kripke’s schema in § 4(c) which implies of 
course that every species is constructively enumerable. But F is inconsistent 
with Church’s schema. 
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FORMAL SYSTEMS OF INTUITIONISTIC ANALYSIS II: 
THE THEORY OF SPECIES 


JOHN MYHILL 


1. Background. In the first paper [4] of this series I proposed an axiom system 
for the ‘elementary’ part of intuitionistic analysis, i.e. the theory of natural 
numbers, computable functions and free-choice sequences. The treatment of 
species in that paper was very sketchy (I merely adjoined comprehension 
axioms, giving two forms of the system, one predicative and one impredica- 
tive). In this second paper I consider the question of axiomatizing higher- 
type objects more fully. I believe that what I have done is adequate to the 
uses of higher types in the actual writings of Brouwer. 

Unfortunately, it does not seem that a theory suitable for this purpose 
can be obtained simply by adjoining new axioms to my old system. The rea- 
son for this is that the old system was designed to include ‘Kripke’s schema’, 
with a view to formalizing Brouwer’s so-called historical arguments which 
are being discussed by Professors Troelstra, De Iongh and Van Rootselaar 
and Mr. Hull at this conference. Let us recall: the schema states that for any 
proposition 2, possibly containing parameters, we can form a sequence 
T such that ty(n) = 1 or O according as 2 has or has not been proved by 
the nth stage. Thus 


Ta(n) = 1 > U (1) 
and 


(Vn)(ta(n) = 0) o — W. (2) 


This schema (explicitly stated by Brouwer, and even with + instead of > 
in (1)) is used by him e.g. to prove that —(Vx)(a(x) = 0) does not imply 
(Ax)(a(x) 4 0). (For this, consider « = ta, 4 with an undecided Y: then by 
(2) =(Vx)(a(x) = 0), but no n for which a(n) 4 0 is known.) 
In my old system I asserted Kripke’s schema in the form 
151 
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Bray [Ce teloa) = Ove) = 


A 
(Sx)(a(x) = 1) > W 
A 
(Vx)(a(x) = Io =H 

i.e. we asserted that the ty of (1)-(2) was a free-choice sequence. This got 
us into trouble with (V«)(3f)-continuity, i.e. the assertion that if (V«)(3f) 
%(a, 8) then the f depends continously on the «. (Here and henceforth 
a, B, y denote free-choice sequences). To see why this is so, let 2 be “(Vx) 
(B(x) = 0) with a free-choice parameter B; then — (8 = 0) (@ = 0) 
and a(x) = 1 > f = 0, so that « cannot possibly be a continuous function 
of B. This point (the contradiction between Kripke’s schema and (Va)(3f)- 
continuity) was I believe first made by me in my ‘Logique et Analyse’ article, 
and has been explored at length by Troelstra in one of his lectures. 

A makeshift repair was made by me in my Amsterdam paper last year (the 
first paper of the present series), by substituting the (V«)(47)-continuity 
axiom for the (V«)(4f) one. The resulting system, including Kripke’s schema, 
has resisted any attempts to find a contradiction in it and not much analysis 
appears to be sacrificed. For it is proved in the book of Kleene and Vesley 
that (V«)(3n)-continuity implies (V«)(3!8)-continuity (4!8 = there exists 
an (extensionally) unique f) and it is the latter that is almost always needed 
in analysis. Further the B = tyy,)(ax)=0) 18 conspicuously non-unique, at 
least on the intersubjective version of intuitionism; different mathematical 
subjects are evidently free to prove things in different orders. 

However, this repair ((Va)(3n)-continuity replacing (V«)(3f)-conti- 
nuity) is not based on a careful philosophical analysis of what it is to be 
‘given’ a free-choice sequence; one finds this out the hard way when one 
realizes that on a most evident assumption about functionals, the two forms 
are equivalent. 

More precisely: the axiom of choice says 


(Va)(SA)U(a, B) > (4®)(Va) A(z, H(a)), 
at least for 2 with only lawlike parameters. Kripke’s schema says 


(Va)(36) A(¢ =0)ef=0 


(3x)(B(x) = 1) > (Vx)(a(x) = 0) 
for short (Va)(4f)B(a, f). 
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By the axiom of choice there is a function ® for which always 
B(x, P(x), 


in fact (x) is what we called before tyy,»(a(x)=0)- Hence (Va)(3!2)(B = 
= («)); and any such # depends discontinuously on «. Thus (Va)(3!8)- 
continuity is violated and consequently (V«)(4m)-continuity also. Briefly, we 
cannot have in the same system Kripke’s schema, the axiom of choice and 
(even weak) continuity in its unrestricted form. The restriction needed is 
simply that if (V«)(4n)WU(a, ) and if U is extensional in a then nis a continuous 
function of «. Indeed the classical arguments for continuity make no sense 
unless only the values of « are used in computing w. From this we can infer 
(Va)(3!8)-continuity for extensional contexts. Now (Va)(4!8)(B= tyysya(x)=0))s 
but in computing f we use knowledge about « which does not consist simply 
of its values. So the restriction on continuity is not simply an ad hoc one to 
avoid contradiction, but one which is evident once we analyse the meaning 
of the prefix (V«)(3!8). We formalize the requirement that 2 be extensional 
in « by 
Crt, (x, n) <> (Vapn)( = PAs, n) + U(B, n)) 

but here a doubt arises which is not completely resolved. We want €rt,, 
M(x, n) to mean that m depends only on the values of «, not on any other 
(intensional) information. The condition « = BAY(«,n) > WB, n) is a 
necessary one for extensionality in this sense (if « and 8 have the same values 
and (a, 7) and n depends only on the values, then 2(f, 7)); the implication 
in the opposite direction is not absolutely clear though very plausible, and 
we shall hopefully undertake its defense (on philosophical grounds) in a 
later paper of this series. We are indebted to Professor De Jongh for point- 
ing out to us the necessity for such a justification. 

There are a few other additions to be made to the original system. Since 
the functionals ® are now not necessarily extensional (e.g. if a = tyyxyacx) = 0) 
then « = B does not imply Sa = Of) we must include axioms for intensional 
identity = as well as extensional identity =. There is also a problem about 
‘lawlike’. The axiom of choice (in all types) should read someting like 


(Vz)(3y)2U(z, 9) > (36)(Ve)UE, Sr) 
where @® is lawlike if 2{ has only lawlike parameters. However this yields 
(36)(Va) (Ax)(x)(x) = 1 > (Vx)(ax = 0) | 
(Vx)(@a)(x) = 0+ —(Vx)(ax = 0) 
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with a lawlike ®, which is certainly not true if by ‘lawlike’ one means 
‘mathematical’. An ingenious counterexample suggested by Kreisel in his 
concluding address at this conference indicates that even in the case that 
¢ and ty in the axiom of choice are number-variables and only mathematical 
notions occur in Y, still the & may be only ‘empirically’ and not mathema- 
tically lawlike. (This example will be discussed later in the present paper.) 
Hence we need both the notion D of ‘lawlike’ used in [4] and a stronger 
notion M of ‘mathematically lawlike’ (= Troelstra’s ‘absolutely lawlike’ 
in [6]): the formal treatment of this distinction is the most curious feature 
of the system proposed here. 


2. Formal description of the system. Signatures and /evels are defined by 
simultaneous recursion as follows (cf. Schiitte [5] pp. 245 seq.). N (the type 
of the natural numbers) is a signature and its level is 0. If > 1, 0,,...,6, 
are Signatures, and m is greater than the level of each of o,,...,0,, then 
m(o,,...,6,) is a signature and its level is m (the type of sets of n-tuples 
{¥1,--+>E,> with r,;¢0; for i= 1,..., defined by formulas quanti- 
fying only over levels < m). If n > 1 and o,,..., 6,, t are signatures, then 
(o,,...,6,)t is a signature and its level is greater by 1 than the greatest of 
the levels of o,,...,0,, 7 (the type of functionals having arguments of 
types o,,..., 0, respectively and taking values in type t). If t is a signature 
so is [t] (the type of finite sequences of objects of type t). 

If we write (o,,...,0,) aS a signature we understand it to be m+1 
(¢1,-.-,6,), where m is the highest of the levels of ajc. 

The type of a term is defined recursively as follows. A variable with sig- 
nature (superscript) o is of type ; so is a defined constant with signature o 
introduced under RD below. If t,,..., t, are of types o,,..., 0, respecti- 
vely and 3 is of type (a,,..., o,)t, thenle(tje ets eC) type tli 1 isten 
type [t] and 8 is of type N, then t[3] is of type t. 

0 is of type N (zero). 

s is of type (N)N (successor). 

D® and M™ are of type (t). (Here and in some other cases the super- 
script will be omitted if no confusion can arise. D“ is the species of lawlike 
object of type t and M“ is the corresponding species of mathematical (ab- 
solutely lawlike) objects.) 

[ !" is of type [t] (empty sequence of objects of type t). =(o,, o;) 
is of type (o,, o,) (intensional identity). 

If t is of type [t] and 8 of type t, then t*[8] (the finite sequence obtained 
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by adjoining $ to the sequence t) is of type [tr]. 

KOON) is of type (((N)N)zt) (the species of continuous mappings of 
(N)N into type t). 

Certain primitive recursive functionals of specified type also appear in the 
axioms. 

Conventions. An italic variable without superscript will be used for type 
N. Upper case italic variables will be used for species (sometimes the sig- 
nature will be omitted). Large Greek letters will be used for functionals, i.e. 
variables of type (o,,..., 0,,)t, except for N(N) for which we use «, f,... 
Small German letters denote variables or terms of unspecified type; large 
German letters denote formulas (possibly with free variables). 

Formulas are built up from atomic ones of the form 3(t,,..., t,) where 
CMseCt ypu ems Glands 4 are of types 7, ---, c, respectively, 
by connectives and quantifiers of all signatures. [We write t, =t, for = 
(t,, t,) and (sometimes) ti € t$ for t,(t,).] In addition if is any formula 
and ta term of type N, then }, 2 plays the role of an atomic formula. 

The underlying logic is an infinitely many-sorted intuitionistic predicate 
calculus. 

In addition to the usual logical rules, we have: 

RD. If (3x")%€(x) is a theorem (2 without free variables other than x), 
one can introduce a term t of type t by the defining postulate I(t). (Notice 
we do not need to prove uniqueness.) We call the proof of (4x‘)%(x) the 
justification of t. 

The non-logical axioms fall into several groups: 

P. Peano axioms with variables of type N, with = for the identity re- 
lation (sometimes we write = instead in this case) and with induction as a 
schema. 

K. The inductive definition of K“%™”: 


Conzi(VNS = KG, 
(¥n)K(®(n)) > K(Z®), 

(Vb)(Const S > A(S)) A(WWVP)((Wu)U(Y(n)) > WEV)) > (VS)(KS > US), 
where we define Gon3t and = (under RD above) by the defining postulates 
Const dB <> (Vxz')(P(z) = P(x’), 

(Z)(x) = (H((0)))(p«), 
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where 
(px)n = a(n +1). 


Note that our continuous functionals are not necessarily lawlike. 
I. Axioms for (intensional) identity and finite sequences: 


oe 
Wx)ar=y > A(y), 
r=yv a(c=y), 


sail) seas) 
xPVS y <o IPN) Sly) a(¥z < 1(y))\(x(z) = y(z)); 
[{"DN(x) is the length of the finite sequence x!" and we postulate the de- 
fining equations: 
iL ]=0, 
Ix#[y] = sl(x), 
Ul Ja (vx™)(Vy M(x) > U(x * Ly]) > (vx!) M(x), 
n<ix) + x * [y](n)=x(n), x * Lyx) =y, 
n> I(x) > x(n) =0 
(where 0° is an appropriately chosen zero-object in each type). 
(ee = 1a 
(6, s(x) = ~(@x) + [0(x)] 
(course-of-values functional; henceforth we write (x) instead of ~ (4, x).) 
D. Axioms for D and M: 


Dt,a...ADt, > Dt —_(n 2 0), 


where Y;,..-.,21,-+-, 2, are all the"tree vamables ont exceprthe mumber 
variables. (Thus a species or functional is lawlike if only lawlike parameters 
enter into its definition.) Likewise 


Mzr,A...AMt, > Mt  (n2 0), 


where £,,..., 2, are all the free variables of t except the number variables, 
and where the sign} is not used in the justification of any defined constant 
occuring in t. 

Drv — Dt, 

Mrv —Mr 

Mr > Dr. 
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Thus the lawlike objects of any type are a detachable subspecies of that type, 
and the mathematical objects are a detachable subspecies of the lawlike one. 
F. Axioms of continuity: 


(Va)(Ax)(Dx A Ua, x)) A Ext, U(x, x) > (AD € K)[ (Vor) W(a, Sax) A(Dr, 
A...ADt, > D®)] 


where £,,..., 2, are all the free variables of 2 except « and x. (One might be 
tempted to postulate the same thing with M replacing D: but the apparent 
invalidity of the same substitution in the axiom of choice, discussed below 
in section 3, makes us hesitant to do this.) 

S. Axioms for species (comprehension): 


(ereaeU (Vx! re (Vx Sex g cee oe) 
> W(x,,...,X,))A(D), A ...ADY, > DS)] 


where )),,..., 9), are all the parameters of 9{. Thus a species is lawlike if no 
non-lawlike parameters occur in its definition. The same clearly holds 
reading ‘mathematical’ for ‘lawlike’, and we postulate 


(Se 2 Wx, prey Xq)(S(X4 ee) Xn) ee U(x, eps ied Xn)) (MY, 
A...AMy, > MS)] 


where ),,..., ), are as above and where % continues neither | nor any con- 
stant in whose justification + appears. 

Notice that with RD these axioms give us the effect of an abstraction ope- 
rator. 

In-the above axioms m is to be any number greater than the levels of 
0,,...,6, or any variable occurring bound in 2, and greater than or equal 
to the level of any constant or free variable occurring in 2. (For this purpose 
+ is regarded as having level 0). This gives the ordinary (finite) ramified hier- 
archy (Schiitte, loc. cit.). 

C. Axioms of choice. 

AC. 


(Ver) -- - (VEn)(S0) (Ei «+ Bae W) > 


(3®)[((Vz,) Caney (Vz, ) U(r, rela P(E, eer] En))A 
(Dy, a...A Dy, > D®)], 


wherest)y,cee~ atesall the free variables of 9 other than 7,,...,1,, 9 
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and number variables (notice there is no restriction on )); notice also that this 
with RD obviates the need for A-terms). 
DC (Dependent choice). 
UL Pa (vx! )(Ux > (Ay) * Ly) > 
(36) ((Vn)UH(n)) A (Dy, A... A Dy, > D&)] 
where ),,.-., , are all the free variables of 2{(x) except (number variables 
and) x itself. 
With RD this allows us to make definitions by primitive recursion in all 
types: hence the system contains all the Dialectica functionals. 
J. Axioms for the thinking subject: 
Fen A Vel Pea Y, 
((Am) +, 2) <> U 
+, An > mot, U. 
Finally we assert the ‘Never-on-Sunday schema’, which asserts roughly that 
unless we have already proved %2v — 2, we can say nothing mathematical 


about the stage at which Y will be decided. 
NOS: Grt(S) AMS at, (ta € S) > 


F, vt, A Wv (Va) (Vx)(a(x) < 1) 3aeS 
A 
(Wxy\a(x) = 1Ay > x a(y) = 1) 


(Wx < n)(a(x) = 0) 


Here ty is introduced by RD using the axioms for thinking subject and 
satisfies 


Ty(x) < 1, 
T(x) = 1 > (t(x+y) = 1), 
UW <> (4x)(ta(x) = 1). 
NOS yields as consequences to both Troelstra’s 
Maa-t,%aat, AX > A+, (ty = 2), 


(where we do not see the necessity that 2{ contains only lawlike parameters). 
and the strengthened form of Kripke’s schema discussed by Hull in [1]. 
We presume that much more can be said about +,, M and their relation- 
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ships; for example, there seems no way of proving from what is said here 
that — (Vy)(Dz > Mx). (There is an obvious intuitive argument.) But 
this is a subject for another paper. 


3. Concluding remarks. These remarks concern the particular forms of the 
axiom of choice (AC and DC) that we used. In both of them we have been 
more cautious than would appear on the face of it to be warranted: we wish 
to justify this caution. 

Consider first a simple case of AC 


(Vx)(Sy)U(x, y) > (AG)(Vx)[U(x, x) A DE], (1) 


where % contains only mathematical parameters and constants. Should we 
not in this case be allowed to infer M® rather than D@? (Notice that 
by an earlier observation in section | of this paper, we cannot do this if 
t and 4 are replaced by free-choice variables.) 

Let us consider the usual intuitionistic argument for (1). The hypothesis 
is proved by exhibiting a method for obtaining the y from the x; ®(x) is 
simply the y obtained by that method. There are it seems several possible 
situations. (I) The method may not use any information about any tT» at 
all. In that case there is no trouble with (1) and @ is absolutely lawlike. (II) 
It may use general properties of the tg (e.g. Kripke’s schema itself or NOS) 
but still no particular values of any ty. This still seems harmless to me; in 
particular if Church’s thesis were refutable on that basis I would think 
Church’s thesis was simply wrong, i.e. that there are uniform methods which 
are constructive from the point of view of intuitionism and yet non-recursive. 
(III) Particular values of some t,, are used in computing y from x, and yet 
they always lead to the same y. If that could be proved, i.e. if (Vr)(4!y) 
M(x, y), it seems to me that this reduces to II since the particular values of 
T are not used in any essential way, but only its general properties. This 
case of (1) (the (Vx)(S!y)-axiom of choice) seems much more justifiable, 
but of course not as useful as the general form since it only applies to decid- 
able 2[. (IV) One or more tg’s appear in 2. Then y can certainly be expect- 
ed to depend on x ina non-(absolutely) lawlike manner, but that need cause 
us no concern: the restriction on the D-axioms (that no defined constant can 
be asserted to belong to D if Kripke’s schema was used in proving the exis- 
tence theorem for that constant) prevents the application of (1) since the 
T 3S count as non-lawlike. (V) There may be no t,’s in YW, nor any notion 
defined in terms of them, and nevertheless the particular values of certain 
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Ty's could make a difference to the value of the y obtained by the ‘method’. 
It seems highly implausible to me that such a thing could ever happen and 
yet I cannot prove that it is impossible. Quite probably a detailed philo- 
sophical argument could rule it out; but Troelstra has been very scrupulous 
and refrained from asserting (1) until such an argument is forthcoming. 

In his paper [3] at this Conference, Kreisel has given a strong argu- 
ment against (1), not indeed for 2's written in the notation of this system, 
but for another 2% which we might quite reasonably wish to add. This 
M(x, y) means: either x is not the Gédel number of a proof (in some 
particular intuitionistic system) of an existential statement (3z)8(z), and y 
is 0; or x is the Gédel number of such a proof and y is the z yielded by the 
proof. Then the hypothesis of (1) is true, and there is certainly a method (name- 
ly examination of the proof x) to find the y; but one cannot in general 
expect a recursive method. For particular intuitionistic systems such methods 
are sometimes known (cf. Kleene [2]) but there seems no justification at all 
for asserting that given an intuitionistic system (e.g. the present one) we can 
find (from the proof that it is an intuitionistic system) a recursive ® for this 
particular 2[. Hence we have no reason to assert the conjunction of the two 
following propositions: 

For this particular 21, there exists a ® which is mathematical (absolutely 


lawlike) (2) 
and 
Every mathematical ®e€(N)N is recursive (Church’s thesis). (3) 


In our system, we assert neither (2) nor (3). But when we reflect on the 
meaning of ‘lawlike’ and ‘mathematical’ Kreisel’s argument seems to throw 
doubt on (2) more than on (3). There certainly does exist a © for (1); we 
can tell someone ‘analyse the proof x and find the z yielded by it’. This is a 
lawlike method (certainly it does not depend on any free choices) but it is 
not a mechanical method (it demands understanding and reflection, not 
only following rules). Nevertheless we could reasonably maintain that the 
predicate 9{(x, y) isa mathematical one rather than an empirical one: hence 
we cannot in general strengthen D to M in (1). At least no argument has 
been given why we should. 

Our last remark concerns the axiom of dependent choice. One might be 
inclined to postulate something simpler namely 


(VreS)(3yeT)U(z, 9) + (26 (S)T)(VeES)U(z, H(z). (4) 


That certainly yields DC as a consequence, and at first sight it looks plausi- 
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ble. Why not accept it instead of DC? To avoid the notational complica- 
tions involved in partially defined functionals ®e(S)T, let us consider a 
weaker form of (4) (but still enough to give (DC). If we succeed in making 
the weaker form doubtful, we shall have no reason to assert the stronger one 
either: 


(Vee S)(Ay)U(z, 9) > (AT)[(Vxe S)(3!y)T(E, 9) a (VEE S)(VOTEE, 9) > 
Uz») (4) 


(For decidable S, this is a consequence of AC). 

The reason one is inclined to believe (4) or (4’) is that at first sight the 
usual intuitionistic justification of AC seems to go over unchanged. But let 
us recall that justification, and see where the difference comes about. 

If (Vr)(39)WU(z, 9), then by the intuitionistic interpretations of 3 there 
is a method for finding the ) from the r. Then @(r) in AC is simply the 9 ob- 
tained by the method. Likewise in (4’) we are inclined at first to say: T is 
the species of those pairs (r, )) where r € S and where 4) is obtained from ¢ 
and the proof of r € S by the method which we must have in order to assert 
(Vzr)(c eS > (Ay)A(z, h)). But if we analyse this in terms of the theory of 
constructions, we see that we have made an error. 

Let us recall some principles of that theory: 


nm proves (Vr)%(r) iff 72, proves that (Vr)[2(x) proves WU(x)], 
nm proves +8 iff 7, proves that (Vp)(p proves 2 > 2,(p) proves B), 
nm proves (3r)U(r) iff 2, proves U(z,), 
Applying these principles to (4’) we get 


m proves (Vr)(z eS > (3h) 2(z, 9) 


c= 


m, proves (Vz)(m,(z) proves (re S > (ay)U(z, 9))) 
-_ 
m, proves (Vr)((z2(z)), proves (Vp)(p proves re S > (22(r))2(p) proves 
(31 )U(z, 9))) 
— 
(Vzp)(p proves re S > Uz, ((z2(Z))2(P))2): 
i.e. the 9 in (4’) depends not only on the ¢ but on the proof that re S: 
different proofs can be expected to yield different )’s. (This is particularly 
clear if S is defined by an existential condition). 
On the other hand, the theory of constructions yields a straightforward 
justification of DC itself. 
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Assume 
21 lee (5) 


then something, say x*, proves it. Assume also 


(Wx!*)(Qx + (Ay) U(x * Ly])), (6) 
then the antecedent of (4’) yields functions @ and yw such that for all x7 


m proves x > (x, x) proves U(x*[Y(z, x)]). @ 
So 


n* proves AT J, 


p(x*, [ ]) proves U[y(x*, [ ])I, 
p(e(x*,[ 1), ¥(x*, [ ])) proves Uw(x*, [ DF (o(e*, [ 1). ¥@*, [ 1), 


and in general if we define by simultaneous recursion 
Gi) =e 
Yn = W(ta» Los ++ +> Yn-1]), 
Tr+1 = P(t, [Yo,-- +s Ya-r)), 


we obtain from (7) that always 


i, proves U[) oe ore ab 
and DC is satisfied with ® taken as (Ax)b,. 
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PROJECTIONS OF LAWLESS SEQUENCES 


D. VAN DALEN anbD A. S. TROELSTRA 


Introduction. Intuitionistically, one may conceive a wide variety of notions 
of choice sequence (see e.g. [2], [3], [4] § 10, [5]). 

Many of these notions are of little use to the construction of a theory 
of real numbers because of their ‘anti-social behaviour’ (described in [5]). 
Nevertheless, they provide us with a nice testing ground for intuitionistic 
concepts and principles, and also they represent interesting examples of 
topological spaces. 

In this paper we restrict ourselves to so-called anti-social notions of 
choice sequence; they are supposed to be described within the general 
framework outlined in [5]. Sections 1 and 2 summarize notions, notations 
and principles from [5] which are needed in this paper. For intuitive ex- 
planations we must refer to [5]. 

The ‘anti-social’ behaviour of the notions considered is due to the fact 
that the so-called restricting conditions (see § 2) do not contain non-lawlike 
parameters in these cases. This type of behaviour is best illustrated by a 
result (given in [5]) which states that for a wide variety of anti-social notions 
the concept is not closed under simple continuous operations (like the 
operator I’, defined by Ay. [Ax.2yx] where x is a variable for numerical- 
valued sequences). More precisely, simple relations like 2x = € are possible 
only when x, € are both lawlike. 

Our aim is to ‘approximate’ some of these notions by sequences which 
are obtained by certain lawlike transformations (called projection operators) 
on lawless sequences (of lawlike objects). The resulting sequences are called 
projected sequences. 

With each projection operator the question presents itself: to what extent 
do the projected sequences approximate the intended notion of choice 
sequence? The matter is subtle (see § 4) but in some respects we obtain 
satisfactory approximations. 

For example, in § 6 we obtain a proof of (\.«\V!x-continuity and A\«V!a 
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continuity for projected sequences which ‘approximate’ notions of choice 
sequence with these properties. The results concerning anti-social behaviour 
are also easily translated for projected sequences. 

It turns out however, that the validity of /(\«\ x-continuity may depend 
on the projection chosen for the approximation. This is illustrated by an 
example given in § 4, where for two projections, both approximating the 
same notion, /\«\ x-continuity holds in the first case, but is refutable in 
the second case. From the results in the final section however, it follows that 
/\«\ x-continuity also holds in the second case with respect to predicates 
in a suitably restricted language. Further research in this direction seems 
to be promising. 

The motivation behind the study of projections is two-fold. In the first 
place, lawless sequences are relatively simple, so we may ask for approxima- 
tions (which in some cases are almost as good as conceptual reductions, 
as we shall see) of the more complicated notions by the conceptually 
simpler lawless sequences. In this respect, the elimination of abstr (the 
abstraction-process introduced in [4], [5]) from the derivation of A «V!x- 
continuity for projected sequences is worth noting (§ 6). 

In the second place, because the projected sequences are approximations 
of certain notions, they may serve the same mathematical purposes as these 
notions. Thus we reduce the number of notions required in intuitionistic 
mathematics. 


§1. Preliminaries. In this section we introduce some notations and conven- 
tions. For intuitive explanations we refer to [5]. 


1.1. We use boldface capitals A, B, C, D, ... for constructive classes 
(species) of constructive (lawlike) objects. We shall assume these classes 
to be decidable (in a universal sense), i.e. for a given construction it is 
decidable whether it is a construction of the given species or not. N is 
reserved for the class of natural numbers. For elements of N we use the 
lower case letters k, n, m, x, y, Zz, u, v, w (with sub- or superscripts if 
necessary). 

We suppose these classes C,D,... to be provided with a decidable 
definitional (or intensional) equality, usually denoted by = (or more 
precisely by =¢, =p etc.). We write = for =y. 


1.2. Cartesian products are defined as usual (notations: Cx D, C"). A 
mapping from C into D is any kind of (not necessarily predetermined) 
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method of assigning an element of D to any element of C. A mapping ~ 
from C into D is said to be of type (C )D, or expressed otherwise: g € (C )D. 

We use the lower case letters a, b, c, d, e, f (with or without sub- or 
superscripts) as variables for lawlike elements of (N )N. These functions are 
thought of as being given by a prescription with a proof that a natural 
number is assigned to any given natural number. a = b means that pre- 
scriptions and proofs for a and 5B are identical. 


1.3. Sequences are mappings of type (N)C for a suitable C. In discussing 


mappings of type (N)C we use €, y, vy’, y”,... as variables for mappings 
of this type. 
A finite (ordered) sequence of elements po,...,p, of C is written as 


€Po>+++sPndi BP =per<P>- The equality =~ may be extended to finite 
sequences of elements of C (i.e. to (), C”) by stipulating: 


<Po> #49 De = <Po> a Pm> =pep 2 = MA Ax < n(x = Da): 
Concatenation * is defined by 


Cie se Dees D+ m> Spee Pos > Pavind® 


The length of a sequence is defined by 


WN Gita oo dO Se 
Ith < > = 


For natural numbers, we may suppose a bi-unique enumeration of all 
finite sequences of elements of N to be given, such that < > corresponds 
to zero. In our terminology we shall not distinguish between sequences and 
their numbers in such an enumeration. 


1.4, { , } is used as a lawlike pairing function onto N of type (N’)N; 
J1,J, denote the inverse pairing functions. 

In discussing sequences of type (N)C, we shall use o, t as variables for 
initial segments, i.e. o, t € |), C”. We define 


x0 Der y 
oO, ex —1)> for x Sade 


Leo =m x(0 = 0); 
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<Pos+++sPa-1> * iS a Sequence x’ with 
Yan ape teri ts 
ym=x(m—n) if m>n; 
{ D#X Spee X- 


In describing sequences and predicates, 1 will be universally used as an 
abstraction operator. 


1.5. (First-level) extensional equality between sequences x, €e(N)C is 
defined by 


Y= ¢ =pe A x(xx =e) 


1.6. Let 8 stand for an arbitrary (not necessarily decidable) species. Then 
as a selection principle (axiom of choice) we state: 


ApeCVqeS8A(p,g,y%.-..) > V¥ E(C)BA pe CA(p, Yp, x, ...). 


In case A does not contain non-lawlike parameters, Y¥ may be supposed 
to be lawlike. 
Important special cases: 


AxV yA(x, y) > VaAxA(x, ax) 
NX yA(% Ys ee aa A a ee) 
and for extensional A, i.e. an A such that 
Ao ey Yaa Aes = eee epee mT] (INN): 
we have 
AxV aA(x, a) > V b/AxA(x, Ay. b{x, y}), 
Nx yA) VC XAG yea yay 


1.7. As a general convention, we assume all non-lawlike parameters of the 
predicates to be exhibited in formulae. 


§2. Lawless and choice sequences 


2.1. We consider lawless sequences of type (N)C. Lawless sequences of type 
(N)N are discussed extensively in [2]. 
We shall assume 


VpeCVqeC(rp =c4Q). (2.1) 
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é, 4, 9, ¢ will be used as variables for lawless sequences. Intensional equality 
between lawless sequences is denoted by =. The following properties hold 
for lawless sequences: 


AoV &e€o), (22) 
e=nve xn, (2.3) 

WE Eo 40 cap bq A Se (Es gnc 5 ba 
V xAne éx(# (1, 0... +5 &,) > Wn, &0,-- +» &)) (2.4) 


where # (¢, &,..., &,) is an abbreviation for 
EF EAEFEA...AEFE,. 
From these principles one derives easily 
&=no A x(ex = nx). (2.5) 


Hence for lawless sequences = and = coincide. 
Let us use I for a lawlike continuous operator acting on lawless sequen- 
ces, with values in N. Continuity of I is expressed by 


AeV xV yAneéx(In = y). (2G) 
Let I'* denote a lawlike operator of type ((),C”)N. We shall assume 
ATV I*[AeV xI*éx # 00 Ao(I*o 40> VyAneo(ln = y))]. (2.7) 
A stronger assumption about I* is the so-called extension principle: 
AI*(AeV x0*ex 40> AxV xI*xx # 0). (2.8) 


In combination with (2.7), the extension principle asserts that every con- 
tinuous functional defined on lawless sequences of type (N)C is defined on 
all sequences of type (N)C. 
The principle of /A¢\V x-(extensional) continuity may be expressed as 
follows: 
AeVxW(e, x) > VP AeW(e, Le). (2.9) 


Analogously for /.eVa-continuity etc. In the case of more variables, 
(2.9) is to be replaced by 


(eq Ne, 2 (en, - 6) > VxW(e,,.. «85 X)) > 
ly ee eee) > Weise + bys PnCbi 920 -r End) 
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where # (€,,.-., &,) is an abbreviation of ¢, # ¢, for alli#A j, 1 ican, 
1 <j < n, and where I, denotes a continuous functional acting on n-tuples 
of lawless sequences with values in N. We can make assumptions about the 
I, analogous to (2.7), (2.8). 
In the case of lawless sequences of type (N)N, we can make one further 
assumption: 
ATVeeKAeV x(le = e&x ~ 1) (2.10) 


(see [4], [2]). Hence with (2.9): 
AeVxX(e,x) > VeAn(en 40> AcenW(e,en = 1)), (2.11) 
AeVaXx(e,a) > VeAn(en #0 > VbAcenW(e,b)). (2.12) 
2.2. Let us consider sequences of type (N)NxR, where R is a class of 
lawlike extensional conditions on sequences of type (N)N, i.e. 
ReRay = €aRx - RE, 
with y,€e(N)N. For sequences x of type (N)NxR we introduce two 
projection operators 79, 7, by stipulating for y =pep <<X_, Ran: 
TloX = Cae To Xt = Xn 
ae LG = CRO ie M1 Xyn = Rae 
2.3. We shall suppose R = |),R,; we also assume a (not necessarily 
extensional) decidable binary relation [ on R to be given. Choice sequences 


(a, B, y, 6 are used to indicate choice variables) are sequences of type 
(N)N xR which satisfy the following general a priori conditions: 


(A) Ax(x,4(x+1) D2, 0x), Ax(a,axeR,), Ax(1,ox(%9%)). 


At any stage, an initial segment &x and the conditions (A) are given, nothing 
more. 


2.4. For a clear intuitive picture we make some further assumptions: 
(B) To every RER we may associate in a unique way a condition R* 
such that 
A RERAx(R*xv — R*x), 


ARERAxE(N)N(RY AXR*¥x). 
(C) RER,AR'E€R,4.,AR' CRO 
Aa yy > x+1— (Ray > R*ay)). 
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In simple examples, (C) is often fulfilled in virtue of the stronger condition 
ROR’ > Ax(R*x > R'*x). 

(D) AReER(R*0), AREER, V x(R*2). 

(E) AaAxARER,(Ay < (x+1)R*ay > 


V Re R,4(R’ CRA Ay < (x+1)R'*ay)), 
and 
AaNx\RER,AR’G€R,,,(R’ CRAAY < (x+1)R*aya 


Ay < (x+1)R"ay > V zR'*(a(x+1) *2)). 
In fact we could have taken a weaker condition for (E): 
ANaNhxARERAAy < (x+1)R*ay > 
VzV Re R,4,(R' CRA Ay < (x+1)RGy aR(a(x+1) #2))), 


but the assumption (E) is fulfilled in the relevant examples, and moreover 
it simplifies the construction of projections in § 4. 


2.5. Some other properties which may be assumed for choice sequences 
are listed for future reference: 
(F) There exists a UeR such that 


Ax(U ER,), 
AReER(RCU), 
Axe (N)N(Uz 0 = 0). 
(U stands for ‘universal condition’). 
(G) AaAxVaeCVy Bax = BxanoB = a). 
(WH) AaeCAxVB(Ay < x(x, By = U)AnoB = a). 


2.6. A sequence = ({X9, Ro>, ++. <X_, Rn>> is called admissible (o € Azam) 
in case 
Au =n(R, DRA Au < n(R,eR,)a 


Freee Ny <x) 7 ere) An SA( Ro <xy Aca WR, (Xp 5+ +5 X_)): 


< > is also called admissible. 
As a first principle about choice sequences we have 


Ao € Ayam V “(4 € 0). (2213) 
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If = denotes intensional equality for choice sequences, we require this 
equality to be decidable: 


a= Bpvak B. (2.14) 


Let X, Y, Z be used for predicates which are zo-extensional with respect 
to choice sequences, i.e. 


XAAT &¢ = Mf > XB. 

We assume a principle of intensional continuity for choice sequences: 
X (Ol, Lg 5+ + GA F (OH, Wy... O,) > 

Voelaecan ABea(S (f, %. .a,) > X(B, % ae &,)) |. (2al) 
In many examples of notions of choice sequence 

a= Bo Ax(moax = mo PxAn, ax = 7, Bx). 
This is always true for example when to every &x one can find f, y, y such 
that &x = Bx = yx, B(x+y) # 7(x+y). The proof is given by using (2.15). 
2.7. In case (A)-(H) are valid for our notion of choice sequence, we have 
AaV'xX(a,x) > Vee KAaX(a, e(moa)) (2.16) 

(for K and notation e(«) see [4]) or equivalently 
AaV'xX(a,x) > Vee KAn(en 40> Ac(agaen > X(a, en~1))) 


(2.17) 
Similarly 


AaVlaXxX(a,a) > Vee KAn(en 40> Vahalayaen > X(a, a)) 
(2.18) 
(/\«V!a is interpreted extensionally: X(«, a)A X(a, b) > a = b). 


§3. Examples. Below we indicate some notions of choice sequence by 
specifying [ and the R,. As special conditions we introduce 


Nx =pet AX-[Az 2 x(xz = 0)], 
By =pe 4X. [Az 2 x(xz < 1)]. 
In case of B,, BX may be stipulated by: 
Bay0=0 incase y< x 


BYay  a(y—1) < 1 incase y > x. 
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Similarly for N,. Now we list examples (I)}-(VI): 


OWE UL I. Ning CN EU 


(II). R,, =per {U, Ne Bo Nya (= Ne Boy ie By, 
Nak Bo UN, alee, UU. 
(IIT). R = Ry =per {U} U {R,: a lawlike, ae(N)N}, 
wherc Ree oe kee UR Ro, UL U. 
(IV). R = R, =pere {U} u {R,: x EN}, 
where Ry =per 1X - (y = Ay . a{x, y}); @ is supposed to be a 
fixed universal function for all primitive recursive functions, 1.e. 


«jy .a{x, y}>, enumerates all primitive recursive functions. 
BSE) HS, Ei eee 


(V). Myhill’s notion of choice sequence as discussed in [3] (cf. also [4], 
10.2 (D)), which was inspired by Brouwer, can be reformulated 
so as to fit our scheme of description; see § 4. 


(VI). Like example (V), but with a restriction to primitive recursive 
spreadlaws as given by an enumerating function. 


§4. Projection operators 


4.1. We shall consider an easy example first. 
Let R contain a single condition 


Ry axa — OV yx — 1). 
We stipulate ( by RR. R* is specified by 
R*¥(x+1) = yx = Ovyx = 1. 
R*0=0=0. 


Then we have obtained the lawless sequences with values 0, 1 only. 
Now we define Proj on lawless sequences of natural numbers as follows: 
Let « = Proje, ¢ a lawless element of (N)N. 


0, 
ity = (ey —Ovey — l)vAcx — 1, 
(ey = 1), 


Te ae — sO, vax (cy — Oy eye al) Avex 


Ty &X 
Pegg — ein yy 


m™,ax = R for all x. 
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Now this is a bad approximation of our notion of choice sequence, since 
we clearly have (a denoting any lawlike sequence of (N)N): 


At Va (np Proje = a) 


for as soon as ey > 1 for some y, Proj ¢ is lawlike; and for lawless sequences 
we have \e — — Vx (ex > 1). 

Proj*e defined by z9% = Ax. sg(ex) and z,ax = R for all x, is a much 
more accurate approximation; now we have 


Asha (fo Proj* e = a). 
The projection operators we shall consider obey the conditions below 
which guarantee that a certain minimal degree of faithfulness is obtained: 
(i) Proj en is determined from én. 


(ii) If Proj en*<x, R> satisfies the general conditions imposed by (A) 
(is an admissible initial segment for the notion of choice sequence 
considered), then we can find a p such that for 9 with 4(n+1) = 
= én * p we have Proj n(n+1) = Proj en * <x, RD. 


One might express these conditions by saying that Proj ‘preserves local 
freedom of choice’. 


4.2. Let us discuss example (I). We approximate this notion by a projection 
of lawless sequences of natural numbers. Let « = Proje, 


' : ee 1) 
ie ee if (\u x(jpeue— 0) 
0 otherwise; 
; ae = 
eS | if ase = x(jzeu = 0), 
N, otherwise. 
The projected sequences are denoted by a, 8, y,.... The following strong 


form of extensional continuity holds. 
THEOREM 4.1. Let X be an extensional predicate with a single non-lawlike 
variable, then 


AaVxX(a,x) > Vee KAaX(a, e(t9%)) 
holds. 
Proor. X is extensional, i.e. 


Tot = Tio — (X(a, x) + X(B, x)). 
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Suppose AaVxX(a,x), then AeVxX(Proje,x). Using (2.10) for 
lawless sequences of natural numbers ([2], 2.4) we obtain an e € K such that 
/\eX(Proj €, e(e)). Now take any a and define x by x Sper Ax . {19 ax, O}. 
Let z be such that e~z # 0, and let x)fh¢2,az, B = Proje. Then either 
éz = zz and then X(f, e(xz)) since e(y) = e(e) or 

Ez = <{719 20, Of, .. ., {Mo au, O}, {0, yuri}, -- +» (0, 2-1} 


with y,4, # 0. Then if we take an y with 4(z+1) = xz * < {0, 1}> we have 
e(n) = e(x) and zo Proj 4 = mo Proj ¢, so X(f, e(x)). Therefore X(f, e(x)) 
holds in all cases. Next one constructs an fe K such that 

f(mpaz) = e{m a0, 0},..., {%9a(z—1), 0}. 
Hence A aX(a, f(z %)). 


To illustrate that finding a suitable projection operator is a rather delicate 
task we will change the operator defined above slightly and show that for 
the resulting notion we do not have extensional continuity. 

Define the projection operator Proj by « = Proj «if 


moa = {2 if Au < x(j,eu # 0), 


otherwise; 


: er 
—— fe is ww x( jeu # 0) 
x otherwise. 


One can check that the proof of the above theorem does not go through, 
but we will do more and prove 


THEOREM 4.2. For suitable X(a, x) we have 
s(AaVxX(a,x) > AaVxVyABl(aoax = no Bx > X(B, y)). 


Proor. Take for X(a,x) the formula V e(2)% = mo Proj eAj,60 = x). 
Clearly ¥ is extensional and A. «\V xX(a, x) holds. Suppose 


Aa\ xV yA plieax = ZoBx > X(B,»)) 
then 
AeVxV yA n(to Proj ex = mo Proj nx - X(Proj n, y)). 


Hence there is an e € K such that 


AeA n(x Proj e(j, e(€)) = mo Proj n(j, e(e)) > X(Proj n, j2e(e))). 
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Now take y = Ax. {0, 1} and let evu # 0. 
Put j,e(y) = u, and j,e(y) = uz, then for any ¢ with gu = yu 


A (to Proj Uy = Xo Projyu, > X(Proj n, 2). 
Letn —<0,0,..,0> (up times) athen 
A n(%o Proj n u, = n— X(Proj n, u2)). 
Hence for an y such that 7) Proj 7 u, = n we have 
V 9' (to Proj n = mo Proj n'a jzn'0 = uz). 

Let Y(y, n’) be the matrix of this formula, then 

Y(n, 0') @ j2n0 = u.v Vm[y' ema 

An (a! man” £1) > to Proj ny = Xo Proj’ A jrn"0 = u2)] 
(here we applied 47 = yn’ vn # 7’). 
Now choose an y such that 2, Proj yu, = n and j,y0 4 u2, then 
Val V mln’ ema An (ne man” #1) > 
fo Proj = Ko Projy’’ A jon’ 0 = ua) 
Let 1 ¢ m * <z>(= m’) then 
An" €m' (no Proj n'’ = mo Proj n), 

so there exists an m” such that 


et 


nem’ nr An’ em An em’ (to Proj yn!’ = Xo Proj y’”’). 


From this and the definition of Proj we easily conclude that 7 is lawlike. 
Thus 
AnGg Prog iy — nA jonO = us 7 x J,nx— 0) 


or 
An XG Ploy ae — 1 Ae i — 0): 


Again we can find an fe K such that 
A(t Proj n uy = nAjzn0 # uz > jonf(n) = 9) (*) 
Define xy’ = Ax. {0,u,+1} and let v be such that fy'v 4 0, v > f(y’). 
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Choose n € x'v, thenj,nf(n) = j2x'f(x') ¥ 0, which contradicts(*). Therefore 
we have 


(Aa x X(2,x) > AaVxV yA Bligax = ZoBx > X(B, y))) 
for the XY chosen above. 


Remark. We could have strengthened the result by using (2.7) and (2.8) 
instead of (2.10). 


4.3. Construction of projections. Suppose a notion of choice sequence to be 
given which satisfies (A}-(F), and for which the classes R, may be enumer- 
ated as {R[p, x]: pe C}. Further we assume R[p*, x] = U for all x and for 
a certain fixed p* eC. 

We remark that condition (E) on choice sequences implies the existence 
of a function s’’e(CxN)C (in virtue of a selection principle as indicated 
in 1.5) such that 


Aahx\ peC(Ay < (x+1)R*[p, x](ay) > 
R[s''(p, a(x+1)), x+1] C R[Ep. x] a 
Ay < (x+1)R*[s’(p, a(x+1)), x+1](@y)). 
Now suppose s’ € (Cx NxC’)C to be such that 
ApeCVreC(R[s(p*, 0, r), 0] = R[p, 0), 
ApeCAqgeCAx(R[p, x+1] CRE, x] oO 
VreC(R[s’(q,x+1,r), x+1] = R[p,x+1))). 
We construct se (Cx NxC’)C as follows. We put 
s(p*, 0, r) = s‘(p*, 0, r). 
Let s’(q,x+1,r) =p; in case Ay < (x+1)R*[p, x+-1](@y), we take 
s(q, a(x+1), r) = p, 
and in case — Ay < (x+1)R*[p, x+1](a@y) we take 
s(q, a(x+1), r) = s’’(q, a(x+1)). 


Now we define a function te (Cx N*)N such that 


{t(p, 0, u) : we N} 
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enumerates the v such that R*[p, 0](0), and in case Ay < xR*[p, xJay 
{t(p, ax, u) : ue N} 


enumerates the v such that R*[p, x](ax * 0). 
Now we define a projection operator Proj for lawless sequences of type 
(N)N xC’ as follows. If A n(en = <a én, 7, n>), then 


Proj B= Cee ’ R[p(n), n\>>n 
is given by 
p(0) = s(p*, 0, x, £0), 


Xo = t(p(0), 0, 260), 
and for v > 0 


p(v) 


Xp 


S( DVD) Xp 5s ete 1 ED) 


t(p(v), <Xo 9.9 8 0H Xy-1)s Tio £U). 
4.4. A lawlike function of type (N)N is said to be a spreadlaw if 

a0 # OA AnV x(an # 0 a(n #2) £ O)A 

Anf m(a(n*m) 4 0 > an # 0)A An(an = Ovan = 1). 

We write ae Spr to indicate that a is a spreadlaw, and y € a is an abbrevia- 
tion for /x(azx # 0). For spreadlaws a and b we define a&b by 
An(an # 0 > bn ¥ 0). 

In Myhill’s presentation in [3], which was inspired by Brouwer, sequences 


of pairs <x,, a,> with x,¢N and a, eSpr are considered. These sequences 
must satisfy the conditions 


(i) +1 S a, for every k, 
and 
(11) dy <Xos 2 ke & O for every & 


As it stands, this notion of choice sequence is not ‘basic’, since the spread- 
laws do not constitute a decidable subclass of the class of lawlike functions, 
and moreover, € (which should play the role of [[) is not decidable relative 
to spreadlaws. 

Therefore we describe a modified notion which is ‘basic’ (i.e. meets our 
general requirements in section 2), and which is closely related to Myhill’s 
notion. 
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First we remark that we can codify a spreadlaw by an arbitrary lawlike 
function (cf. [1], p. 181). Let @ be a lawlike function of type (N)N, then a 
spreadlaw c is defined by 

Gy) = il, 


o(n #8) = sg en-sg(jza{n, x}+(1=|x—j,anl)), 


where sgx = 0 if x = 0, sgx = 1 otherwise. Note that the j,a-part 
enumerates choices permissible after 7, and that the j, a-part makes certain 
the existence of at least one choice after n. Let ©p be the (primitive recursive) 
operator which associates c with a as given above. Conversely, to each 
spreadlaw c € Spr we can find a lawlike a such that c = @ga: let d be the 
choice function existing in virtue of An\V x(cn = 1 — c(n* &) = 1), define 
Givi} — c+ x), and 4,7 —sdn,, thenii.a — 1a), a5; — AXx{a, X, a,x}, 
®,a =c holds. (Thus we have constructed an operator Y such that 
@,%c = c for ce Spr.) 

We also have a device for codifying subspreads of a given spread. Define 
d' from a spreadlaw c by 


Gia, oe c(t (1 (nl # x) * cn, 


where c’n = min {x: c(n* %) = 1}. If c = ya, d’ is primitive recursive 
in a, For every n, Ax . d'{n, x} enumerates all possible ‘next choices’. Now 
we define d primitive recursively from 5, a by 


d0 = 1, 
d(n #&) = sg dn« sg[c(n * &) - sg j, b{n, x} +(1=|x—d’ {n, j, bn}|)], 


where c = ®ja. Let ® denote the operator which constructs d from ce, b, 
then d = (Goa, b). We put 


LD (Op ora 5 oy Cy) OO al (eee ee 
Our notion of choice sequence is now described as follows. Let 


ee pee Seon iG) Gos, lawiikeU {U}, 
where 
Rae ee — ne nie Odo, a,)l, RE = U. 
C is defined by 
Riao,..-,4,> (R<bo,...,b,> iff vtl=u and Aw< v(a, = b,), 
Eecdg ys: ., daa U, UU. 
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C is clearly decidable. Take C’ to be the class of all finite sequences of 
lawlike functions, and let C’ be the class of all lawlike sequences together 
with 0. We index R,, in this case by stipulating: 


R[¢ >, x] = U, (p" =< 9); 


IRE le 9 0 8 66 in case u = x, 
R[ <a p Jarkt olich J ay | = <4o 4 2 . 
U otherwise. 


The function s’’ we may take in this case to be defined by 


ae 120, sO 4)) = < aie He plicelat, 


s'’ = 0 in all other cases 
(@(a, Ax . {1, 1}) = a). For s’ we stipulate 


S(< >, 0, Ona 

5(< >.x, a= "<a> ter all, 

5 (<dos7 5 Ck ly | ar ee my 
s‘(p, x, r) =< > in other cases. 


Now the projection-operator defined on lawless sequences of type 
(N)N xC’ may be constructed as indicated in the previous subsection. 


4.5. In the case of example (VI), we simply take all primitive recursive 
spreadlaws enumerated by a universal function; note that oa, B( Poa, b) 
are primitive recursive in a, b. 

For the enumerating class C’ one may take N in this case, and one obtains 
an approximation by a suitable projection of lawless sequences of type 
(N)N. Details, which are similar to those in the case of example (V), are 
left to the reader. 


§5. A «\ x-continuity. From theorem 4.2 we saw that /« V x-continuity 
is not always valid for projected sequences of the types considered. In fact, 
there are many more cases in which /\«\ x-continuity can be disproved; 
adapting the trick from theorem 4.2 to the choice sequences themselves 
we succeed for example in disproving /\«V x-continuity for our example 
(II), taking for X(«, x): 


V B(toe = moBAL(x = OA, BO = U)v(x = 1A 7, 80 = Bo) 
v(x = 2An,f0 = No)))). 
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Similarly, /\«\V a-continuity may be disproved for example (V) by taking 


X(a, 4) =per V Blt % = MB 
A(x, BO = Ay. ye ®(a) v(a = Ax. 1A7, BO = U))]. 


§6. Extensional continuity. In this section we will prove extensional con- 
tinuity for an extensive class of notions of projected sequence. We shall 
employ (2.6)-(2.9) in the form 


AsV xX(e,x) > VPA eX(e, Le), 


where I" is a continuous operator (¢ lawless of type (N)C’). 

In the following x,&€ will be arbitrary sequences of type (N)C’ and 
é,n,¢,9 are lawless sequences of type (N)C’. a, B, y denote projected 
sequences. We now list some properties of Proj. 


(1) Axl A x(x, Proj xx = U) > AaAy(aeé x Proj xy > 
V e(éy = XV AN Proj e = a))], 

(2) AxAeVaV n(éx = 9x AX Proj yn = a), 

(Cla) yin, Proj ye = axe, Proj yx =), 

(4) AnAEAx(En = gn > Proj En = Proj yn). 


Lemma 6.1. Assume (1)-(4) and let X(a, x) be extensional, then 
AaV!xX(a, x) > AaVixVyA B(xoay = moby > X(B, x)). 
Proor. Suppose A aV!xX(a,x). Then we have /\eV!xX(Proj ¢, x), 


hence there is a I such that 
(A éX(Proj ¢, Fe). 
Moreover 
ne, JARO) = ap ECO yg oe (5) 
Take any « = Proj e. Then we can find a y such that 
Ae ra Ptayx — 1)0x Am, Proj yx = Ul 


(by (3)). We suppose I to be defined on all functions of (N)C’. There 
exists an 1 such that 


ry =x A&(En = yn > Fe =x). (6) 


180 D. VAN DALEN AND A. S. TROELSTRA 


Now suppose 
B = Proj€, 


ToBn = To4n = Mo Proj yn. 
Then we can find x’, n’ such that 
Te=x'oAN&En’ 3 ln' 5 TE = x’). 
We may suppose ’ 2 n, Furthermore there are m, z such that 
Fe=zo A&(Em = im TE = 2), men’. 
Now there exist by (2) e,, a such that 


&,m = EMATN, Proj g; = a, 
hence 
LON I 15 


By (1), we can find an 7 such that 
{im = AMAT Proj yn = a, 
since a € 7, Proj ym. Also 
Dy 


(since m > n). Furthermore 
Pe = If ye = Ip 

On account of (2), there are 9, b such that 

En’ = 9n' Ang Proj 9 = b 
and because of (1) 3, exists such that 

Gn = {nA Proj 9, = b 
(since b € zy Proj yn; this follows from 

b € Mo Proj €n = no fn = moan = To Proj yn). 

Therefore by (10), (7) and (11), (6) 


rt =F9aTy=T9, 


BX 


(7) 


(8) 


(9) 


(10) 


(11) 
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hence using 2%) Proj 9 = zg Proj 9,, (8) 
asia — [eee 
Therefore with 75 fn = moan etc. 
AaVixX(a,x) > AaVixVyA Bay = By > X(f, x)). 
Remark I. A completely parallel argument may be given to prove: 
AaVlaX(a,a)> AaViaV yA B(toay = toBy > X(x, a)) 


under the same conditions. 

Remark IT. In the conditions (1), (2) of the lemma, the quantifiers (a, V a 
may be restricted to a species Ysay \aeY, VaeY. This does not alter 
the proof. 


LEMMA 6.2. Suppose the conditions (1)}-(4) for our projection operator to be 
fulfilled, and let an operator I of type ((N)N)(N)C’ be given which transforms 
Mo& into a sequence ¥ =p, I’m oa which satisfies the requirements of con- 
dition (3). Then for projected sequences 


AaVixx(a, x) = VeAaX(a, e(aoa)). 


Proor. Let I be the operator of type ((N)C’)N introduced in the proof 
of lemma 6.1. Since I is defined for any x of type (N)C’ (on account of 
the extension property), I is certainly defined for all I’’e, ¢ lawless of type 
(N)N. Therefore (formula 2.10) 


Vee KAxe(N)NUT’y = e(x) 
and hence 
A aX(a, eto). 


THEOREM 6.3. Let a notion of choice sequence be given for which (A)-(H) 
are satisfied, and let Proj be a projection operator corresponding to this 
notion which is constructed as indicated in subsection 4.3. Then for the 
projected sequences: 


AaVixxX(a, x) = VeAaX(a, e(m%)). 


Proor. Assume for simplicity the C in (G), (H) to be the collection of all 
lawlike sequences. We have to verify the validity of (1)-(4), in order to be 
able to apply the previous lemma. The validity of (4) is immediate. (1) is 
derived from (H) as follows. Let x satisfy A x(x, Proj yx = U) and 
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a € Tp Proj yy. According to (H) there exists a choice sequence f such that 
Az < y(n, Bz = U)An B =a. 
Hence by 2.15 for some g€ Azan 
ApeolAz < y(2, Bz = U)AnoB = al. 
Suppose Ith o > y. Then for any eo such that A x(Ex € Aja) we have 
Vz jez = UAnge — a 


For suppose uv > Itho, then we can find a Beo such that Be E(u+1). 
Then 2) 8 = a, so 19Bu = au = mou; hence 17)9¢ = a. By our method of 
constructing Proj, V e(Proj e € a) holds. Therefore 


Az < y(x, Proj ez = UAnp, Proje = a), 

and thus 
V e(éy = {VA NM Proje = a). 
(2) is derived from (G). We have to prove 

AxAeV aV n(éx = 9x A No Proj y = a). 

Take any éx. (G) can be rewritten as 
Ao € Ayam V B(B EG ATP = A). 
Take o = Proj ex € Aygm- Then 
V B(BeEonnoB = a), 
and hence for some o’ (o’ an extension of c) 
Ayeéeo (toy = a). 


By the preservation of ‘local freedom of choice’ for our projection operators 
we can find an y such that 7x = éx and Proj neo’. Hence 2, Projy =a 
(by a reasoning analogous to the argument given above). 

(3) is satisfied automatically if we construct a I’ as required in lemma 6.2. 
I’ is constructed by giving a simultaneous recursive definition of certain 
continuous operators I’ e ((N)N)(N)N, I’ € ((N)N)(N)C’, such that I’ 
is then given by 


AY SRT yx, ex 
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Let € denote a sequence of (N)N. We can find ry € C’, and a lawlike function 
ce such that 


R[p*, 0] 


Zz 


R[s(p*, 0, ro), 0] 
t(p™, 0, cz) 
Let P, denote the class {p: R[p, x] = VU}. 
Now we may define a lawlike function @ such that 
R[p”*, w] = R[s(p, aw, e(Gw, p)), w] 


for w > 0, pe P,, (we have made use of a selection principle here). Also 
we suppose ¥ to be such that 


x = t(p, aw, ¥(p, aw, x)) 
for w > 0, peP,,. 
Now let FP’ € = pw, ’'’€ = v. Then I’, I’ are defined by: 
Po =D’; 0) = Fins i = e208 
Pi = (po, HA, v0), vl = g(H1, po), = wl = 'P(pT,, fil, €1); 
iP, Vit), svn — olin pry), in — Vp. jan, én). 


The verification that I has the required properties is now straightforward. 


§7. Approximation relative to a restricted language. 


7.1. As we have seen, projections approximate the corresponding choice 
sequences to a certain extent, but not completely. However, we might ask 
if projections associated with a certain notion of choice sequence yield 
better approximations if we make suitable restrictions on our language. 

Let us consider a simple example as an illustration. Let A denote the 
continuous operator on sequences defined by: 


Ay = Ax. x(x +1). 


If e,4,... are lawless sequences of (N)N, we may consider the sequences 
Ae, An, .. .; im many respects, these sequences behave like lawless sequences, 


although not entirely so. For if we use «, B,... as variables for A-trans- 
formed lawless sequences, we have 

Aa e(a = As), 
whereas 


— AnV en = Ae) 
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(even more, Ane —(n = As), as is readily proved). Therefore these 
A-transforms are certainly not lawless sequences. One verifies easily 


& = +> Ae = An. 

Continuity does not hold. Take (a, x) to be: 
X(a, x) =per V e(a = Aeacd = x), 
then Y is extensional, and also \aV!xX(a, x), but not 
AaV xV yA Bax = Bx > X(B, y)); 


this is easily refuted. But this disproof of A « V x-continuity uses a property 
which refers explicitly to the lawless sequences from which the A-transforms 
were obtained. 

We may suspect that Aa\x-continuity would hold in cases where 
X(a, x) does not explicitly refer to the original lawless sequence. This turns 
out to be true, in a sense which will be explained below. 


7.2. Let & denote a second order language with variables for lawless 
sequences of type (N)N: ¢, n, ¢, 9; variables for numbers: x, y, Z, u,v, W, n,m; 
variables for lawlike functions: a, b, c, d; function constants for certain 
primitive recursive functions, like 0, *, +, - etc.; a constant predicate K 
for lawlike functions, and a binary predicate = for numbers. The logical 
operators are those of three sorted intuitionistic predicate calculus: 
A, Vv, 7,27, Ax, Vx, As, Ve, Aa, Va. 

Terms, atomic formulae and formulae are defined as usual; let Tm denote 
the class of terms, Fm the class of formulae. 

Let I be a continuous operator of type ((N)N)(N)N. Fm(L) consists of 
formulae which are obtained by replacing in each Fe Fm every occur- 
rence of a lawless variable by its I’-transform, i.e. ¢ by Ie. 

Now the following two assertions may be proved* for [= 4, =T, 
(definition follows), under the condition e = 7 @ lye = Fyn: 


(i) For any X e€ Fm(I) 
X(@, €9;--+,8,) > 116 = beg Vs. V be — ery ninete nen 
AnUinenaly € Bega... Aln # Te, > XH, &. «. -, en) 


* We are indebted to professor G. Kreisel for a hint towards the proof. 
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and the principle of '-continuity: 
(ii) for any Xe Fm(L), 
AeVxX(e,x) > Vee KAeXx(e, e(Le)). 


For any lawlike b, I, is defined by first introducing a lawlike function g, 
defined from b by primitive recursion: 


g0 = 0, 


g(n * &) = bf{gn, {x, Ith(n)}} 
and then putting 


P,x = Ax . g(X(x+1)). 
The condition 


Eide — 1p ti 
is ensured when 


AnVmVxVyg(n*xm#k) 4 g(n*m+#). (1) 


A simple case of an operation I, is the I’ defined by an (arbitrary) c such 
that 


JP) se eres. 
In order to ensure (1), we may require 
Prov) 22 (lyaz) cly.2 }Ay > x). 


We expect that much more general results in this direction can be proved; 
we hope to return to this matter in a further installment. Therefore we omit 
proofs. 


7.3. Let us make a simple application to our example (I). Let for any x, 
k,x be defined as [x/2] (the maximal n such that 2n < x). We put 
k,x = x—2k,x. Now we define Proj, by: 

Hoyexameiiens/ \ sa x(k jeu == 1); 


gy en es | ; 
0 otherwise; 


U if Au < x(k,eu = 1), 


Te elol ee — | : 
INE otherwise. 


By a simple adaptation of the proof of theorem 4.1 we see that we can 
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prove continuity in the form: 
AaV!xX(a,x) > Vee KAaX(a, e(a)). (2) 


Moreover, we see that in case X is such that X(a, x) X'(9q, x), 
X'(e, x) € Fm, (2) can be proved from 


AeVxY(e,x) > Vee KAcY(e, e(e)) 
for Ye Fm. Now, if we define I, by 
I',ex = 2j,ex+sg(jzex), 


then I’,-continuity is valid. 

Remark that Proj, [, is exactly the representation as defined in §4 
(theorem 4.2). Because of I',-continuity and the remark made above, it 
follows that 2) Proj, [',-continuity holds, i.e. 


AaVxX(a,x) > VeeKAaX(a, e(tox)) 
holds for Proj, ',-projections if XY is such that for some X’(e, x)e Fm 


X(a, x) > X'(m%, x). 
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ON SUBJECTIVE MATHEMATICAL ASSERTIONS* 


B. VAN ROOTSELAAR 


According to Brouwer, mathematics is an individual mental activity and 
for that reason we need not be surprised that Brouwer’s intuitionism shows 
a certain subjective aspect. This aspect appears particularly explicitly in 
his papers [1], [2], where he defined choice sequences depending on the 
experiences of the mathematician defining them (the creative subject). 
These examples will be the starting point for our discussion. 

Brouwer’s prescription of the creative subject’s choice activity is quite 
unambiguous and runs essentially as follows: 

1. The creative subject should take in mind a non-tested mathematical 
assertion 2, ic. a mathematical assertion for which no proof is known of 
Av = 1. 

2. Then he creates a choice sequence « = {a(n)} according to the following 
prescription: as long as he has experienced neither the truth of — 2% nor 
that of — — % he chooses «(n) = 0: but as soon as he has experienced the 
truth of Q after the choice of «(r—1), but before that of a(r) he chooses 
a(r+n) = 27’ for all n; and as soon as the truth of — Y has been experienced 
between the choices of a(s—1) and «(s) he chooses a(s+n)= —27° for all n. 

Then the sequence « converges and determines a real number a. At the 
moment we are not interested in the use Brouwer made of this construction, 
but concentrate on its subjective aspects. 

As early as 1949, D. van Dantzig emphasized the ambiguity in Brouwer’s 
terminology and tried to give subjectivistic as well as objectivistic inter- 
pretations of his terminology. In particular he proposed the subjectivistic 
interpretation of the term mathematical assertion as one of which the 
creative subject never will be in doubt as to whether it has or has not been 
proved by him. 

This interpretation essentially has reappeared in Kreisel’s more formal 


* Revised Nov. 1, 1968. 
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treatment in his [6] as one of the axioms for the creative subject (formula 
(1) below). 

In that paper G. Kreisel took the step of using Brouwer’s notion of 
creative subject in a formal way in order to derive purely mathematical 
results. He deviated from Brouwer, who essentially recognized only one 
creative subject, by taking into consideration the existence of many (more 
than one) creative subjects. 

His fundamental notion is 

2 pe ol 


to be read as: the creative subject X has evidence for asserting A at stage m. 
Instead of Kreisel’s 2 |, 2{ we shall use in the following 


P(c, m, 2) 


where o is a variable for creative subjects. 
In [6] Kreisel accepted the following axioms: 


(1). P(c, m, X)v — Plo, m, 2) 

(2). WY (¢) = — (Em)P(a, m, 2) 
(o)[(Em)P(o, m, 2) > 4] 

In [9] the present author proposed 

(39) Y > (Ec)T(a, 2) 


meaning: 2 is true if and only if there is a creative subject o, who has 
experienced the truth of A. The reference to stages was left out (T(a, 2) = 
= (Em)P(o, m, %)), and in comparison to (2) there is equivalence instead 
of implication. Subsequently in [10] A. S. Troelstra rendered Kreisel’s 
axioms as follows: 


P(o, m, 2) v - P(e, m, 2), 
WY (¢) = 4 (Em)P(a, m, YW), 
(3). (Ec)(Em)P(a, m, 1) W. 


So the second part of (2) has been reinforced to (3) (similarly to (3’)). The 
first axiom of (2) is equivalent to 


(2a). — (En)P(a, n, XL) > — UL, 


B XI ON SUBJECTIVE MATHEMATICAL ASSERTIONS 189 


Kreisel’s axiom of christian charity, as reported by J. Myhill in [7], which 
seems to formalize a well-known argument of Brouwer’s: 


(4). if the creative subject has evidence that he will never assert YX, then 
he has evidence to assert — YX. 


We shall return to this below for the motivation of axiom (6). First some 
observations are made in connection with the representation of subjectivity. 

If the creative subject enters explicitly into the formalization of Brouwer’s 
example and if at the same time we do not restrict ourselves to one creative 
subject, the resulting sequence in the example may be denoted by 


{a(o, W)n}, 


and its limit by a(a, 2). If this concept is to be really subjective, we should 
not have 


a(a, U1) = a(c, W) 


for all o and t. However we are not able to derive a mathematical assertion 
to this effect, without making a further assumption. Note that in general 
one objects against Brouwer’s definition on account of the fact that there 
is no proof for 


(o)(z)(m)(a(o, U)(n) = a(t, W(n)). 


If we recall that 
a(a, U)(n) = 0 — P(a, n, Av — AY), 
(n)a(o, W(r+n) = 27" Pia, r, 2), 
(n)a(o, W(stn) = —2°*— Plo, s, — %), 


we see that (En)(a(a, A)(n) # a(t, U)(n)) implies a(o, A) # a(t, WM), hence 
if we assume a(a, 2) = a(t, 2), it follows that 


(n)(a(o, Wn) = afc, W(n)); 


(n)(P(o, n, 2) — P(t, n, 21), 


consequently 


an assertion we should like to deny, e.g. by 
G6), — (0)(t)(n)(P(o, n, 20) > P(t, n, 20), 


for non-tested 2{. This formula expresses a minimal amount of freedom of 
deduction for creative subjects. 
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A formulation of non-testedness of Y{, implicit already in van Dantzig 
[4], is 
nontest (a, n, 2f) 4 — P(e, n, (Em)P(o, m, =% v — —2%)). 


On the basis of this definition it is clear how we are to ensure the intended 
minimal freedom of deduction for creative subjects and so obtain provable 
subjectivity (in various degrees). 

Just as it seems to be reasonable to require a minimal amount of freedom 
of deduction, it seems reasonable also to put an upper bound on this freedom. 
In other words the creative subject should accept evidence on account of 
his having accepted other evidence, which leads one e.g. to assume 


P(g, n, XA B)  P(a, n, W) Aa P(e, n, 8B), 
further: 
if P(o, n, 2) and P(o, n, X + B) then P(a, n, B) for all Wand B, 
and 
P(a, n, 1 + B) > P(a, n, 2) > P(e, n, B). 
Also 
if P(o, m, (n)(Q(n) > B(n))) then P(e, m, (n)WM(n) > (n)B(n)), 
and 
if P(c, m, (n)(Q(n) > B(n))) then P(e, m, (En)W(n) > (En)B(n)).* 
Remark. The assumption of 
if &— B then P(a, n, 2) > P(a, n, B) 
or the assumption of 
if P(o,n, M) and A > B then P(o, n, B) 


for arbitrary { and % is not advisable on account of (3). However it seems 
to be useful to assume one of these rules for certain subclasses of formulas, 
e.g. for the theorems of intuitionistic predicate calculus (not involving P 
or ¢). 


* These five formulas will be referred to as ‘the deduction rules for P-formulas’ and em- 
ployed tacitly in what follows. We shall also assume that if 9{ and % are definitionally 
equivalent, then P(o, m, 2 << 8). 
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Another possibility is to assume P(o, k, 2) for all o and k for these 
formulas. For the present we shall not make such a general assumption, 
but confine ourselves to some ad hoc assumptions in each case. The assump- 
tions preceding this remark may be considered as such. 


To find a basis for our system we look for a substitute of Kreisel’s (2a), 
and indeed the proper relativization of (4) seems to be 


(6). P(c, m, (n) P(e, n, 2)) > P(a, m, 2). 

In fact, in the following we consider as a basis the axioms 

(1). P(o, m, 0) v — Po, m, 0) 

(3). (Eo)(Em)P(o, m, X) <-> MX (a and m not free in 2) 
(6). P(o, m, (n)—P(o, n, 2)) > P(o, m, —2). 


We shall not elaborate the system in detail, but mention additional axioms 
in case they are necessary for our discussion. 

The resulting system is considerably weaker than Kreisel’s, but seems to 
obviate some objections made by S. C. Kleene to Brouwer’s relevant 
statements (cf. Kleene and Vesley [5], 174-176). 

One of the features of Kreisel’s system is the derivability of Kripke’s 
schema. Whether this is considered attractive is a matter of taste; one might 
also prefer to drop the system altogether and merely adopt Kripke’s schema 
instead. We shall not carry out a comparison between these possibilities, 
but see what remains of the derivation on our basis. 

Kripke’s schema is usually formulated as follows, where « stands for a 
free choice sequence: 


(Ex)(((n)a(n) = 0 — 9%) » (En)a(n) # 0 > %) 


On Kreisel’s basis, this is simply derived as follows: 


Put 
a(n) = 0 — P(o, n, 2%), 
a(n)=1le P(o,n, 2). 
Then 
(En)a(n) = 1 — (En)P(o, n, 1), 
Ye) 


(En)a(n) = 1 > O. 
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Also 
— (En)a(n) = 1+ — (En)P(, n, 2), 
so 
(n)a(n) =O — A. 


On our basis we have to proceed a little differently, already from the 
beginning. 
First of all we should not start with 


a(o, U)(n) = 0+ — P(o, n, YU) 
a(o, W(n)=1<+ P(o,n, %), 


for since this is to be looked upon as a definition valid for all o, at least o 
has every evidence to assert the above equivalences for every m. So we 
should start with 


P(a, m, (n)(a(a, U)(n) = 04 — P(a, n, 2))), 


and 
P(a, m, (n)(a(o, W(n) =1< P(e, n, %))), 


for every m and every o. Then on account of the deduction rules for 
P-formulas 


P(a, m, (n)a(o, )(n) = 0) + P(a, m, (n)—P(a, n, 2) 
so on account of (6) 


P(a, m, (n)a(o, 20)(n) = 0) + P(o, m, —2). 


and by (3) 
P(a, m, (n)a(a, W)(n) = 0) > 2, 
(Eo)(Em)P(a, m, (n)a(o, U)(n) = 0) + AA. 
Also 
P(a, m, (En)a(a, U)(n) = 1) + P(o, m, (En)P(a, n, 2), 
so 


(Eo)(Em)P(o, m, (En)a(o, U)(n) = 1 (Ea)(Em)P(a, m, (En)P(o, n, 2). 


A further reduction of these formulas depends on the logic that is assumed. 
We have in mind intuitionistic logic, which seems to be a quite natural, 
although by no means the only possible, choice. 
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Then on account of (3) from P(a, m, (En)P(a, n, 2)) follows (En)P(a, n, 2), 
hence %*, so we arrive at 
(Eo)(Em)P(o, m, (En)a(o, 1)(n) = 1) > UL. 
So much is clear, that Kripke’s schema is not self-evident on the present 
basis. 
If we adopt the converse of (6) 
(6a). P(a, m, 2) > P(c, m, (n)P(a, n, 20), 


then we have 
P(a, m, (n)a(a, W)(n) = 0) + P(c, m, 4%), 
and consequently 
(Eo)(Em)P(o, m, (n)a(c, 2)(n) = 0) AY, 


and our result has more resemblance to Kripke’s schema. 
Let us next apply the idea to some of Brouwer’s examples. Suppose 2 
is non-tested by o at stage k (fixed), i.e. 


— P(a, k, (En)P(a, n, =A v 5 —2%), 
then define a(c, %) by 
(n)(a(c, W(n) = 0 — P(a, n, =X v = 42)) 


and 
(n)(a(o, W(n)=14 P(o,n, “Xv = AY) 


i.e. put 
P(a, p, (n)(a(o, M(n) = 0 — Pla, n, “A v — -2%))), 
P(a, p, (n\(a(o, D(n)=14 P(e, n, “2% v — 72%). 


Then if 
P(a, k, (n)a(a, 2)(n) = 0), 
also 
P(a, k, (n) 5 P(o, n, =X v — 2%); 

so by (6) 

BG (0) 
Since by (3) 

— (Eo)(Em)P(c, m, 5 (A% v — 2), 


* D. Scott pointed out a redundancy in the first version. 
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it follows that 
— (Eo)(Em)P(a, m, (n)a(o, 20(n) = 0), 


i.e. nO o can ever prove (n)a(o, %)(n) =0 for his choice sequence «(a, 2). 
Next if 
P(a, k, (En)a(o, )(n) = 1) 
then 
P(o, k, (En)P(c, n, =U v — 2%), 


contrary to the hypothesis, so 
— P(a, k, (En)a(o, &)(n) = 1), 


i.e. o has no proof of (En)a(a, 2)(n) =1 at stage k. If 2 is non-tested for 
all o at stage k, then no a has such a proof at stage k. It is not reasonable 
to expect more, because the non-testability of 2 at stage k does not preclude 
the possibility of 2 being tested at some later stage. Actually from (8) 
follows that it is impossible that 2{ is non-testable for all o at all stages. 

In another example Brouwer uses an % which depends on a choice 
sequence parameter f. Let 2(f) be such that 


(Ef)U(S) & (Ef) =U); 
of course this should make sense to o, so the hypothesis should read 
P(o, k, (Ef)US) & (Ef) sUP)). 

Then if we put again 

P(a, k, ao, f)(n) = 0 — Po, n, Uf) Vv = AUF), 
and 

PO, k,a(o,f)(n) =1<> Pla, n, =A(f) v = AY), 
we find similarly 

— (Eo)(Em)P(o, m, (n)a(o- f)(n) = 0) 

Next suppose 


P(o, k, (f)(En)a(o,f)(n) = 1), 
then 


P(a, k, (f)(En)P(o, ny Spl Vie —AU(f))), 


hence if o is convinced of the fan-theorem and fis understood to range over 
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a finitary spread, we have 


P(a, k, (En)(f)P(o, n, =U(f) v — AU(f))). 


This is contradictory if o also accepts Brouwer’s corollary that the con- 
tinuum cannot be split, so we obtain 


— (Eo)(Em)P(a, m, (f)(En)a(o, f)(n) = 1) 


(Note that Myhill in his [8] criticizes the use of the fan theorem, because 
it is used in an intensional context. Our context is a little different from the 
one criticized by Myhill). 

The above results do not seem to justify Brouwer’s claims that he has 
proved classical mathematics to be contradictory. As far as I know, no one 
besides Brouwer has ever seriously believed so. 

In this connection I wish to point out some consequences of another 
axiom (which in a more systematic representation might be dependent), viz. 


(8). P(t, k, (a) M(c)) — P(t, k, U(p)). 
If we consider the result of the first example 

— (Eo)(Em)P(o, m, (n)a(o, W)(n) = 0), 
or 

(o)(m) — P(a, m, (n)a(a, 1(n) = 0), 
then we have 
(Et)(Ek)P(t, k, (¢)(m) — P(o, m, (n)a(o, 20)(n) = 0)). 
So by (8) 
(Et)(Ek)P(t, k, (m) — P(t, m, (n)a(t, 2(n) = 0)). 

SO 

(Et)(Ek)P(z, k, — (n)a(t, 2)(n) = 0), 
from which follows 

(Eo) — (n)a(o, U)(n) = 0. 


A similar reduction is possible for both resulting formulas of the second 
example. 

Another application of the axioms for creative subjects consists in the 
possibility of describing quite naturally different notions of choice sequences 
in terms of the knowledge about them possessed by the creative subject. 
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A variety of conditions may be imposed, some of which we mention, 
without pursuing the study of sequences which satisfy such conditions. 


1. AF(a, o) <+(n)(m) — P(e, n, a(n+1) = m). 
Note that from AF(«, a) follows —(En)P(q, n, (m)o(m)=0) (cf. Myhill [7]). 
2. AF(«) < (c)AF(a, 0). 
3a. Lo(a, o) << (n)(Em)P(o, n, a(n +1) = m). 
3b. L(a, o) «+ (k)P(o, k, Lo(a, o)). 


4. Lo(a) (a) Lp(a, o) & (m)(n)(c)(t)(P(o, n, o(n+1) = m) 
<> P(t, n, a(n+1) = m)). 
5. L,(a) + (n)(Em)(c)P(o, n, a(n+1) = m). 


Such notions (formulations) seem to be useful for the problems mentioned 
in e.g. Myhill [8] and Troelstra [10]. Observe that in the approach outlined 
above one does not encounter difficulties which might call for an assumption 
like the arrangement of proofs in a sequence, as mentioned in e.g. van 
Dantzig [4], Myhill [7] and Troelstra [11]. Questions as to what every 
creative subject o should be able to deduce (as encountered by Troelstra [11]) 
may be handled on the basis of our remark. 
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A PALATABLE SUBSTITUTE FOR KRIPKE’S SCHEMA 


Rees VESEEY 


1. Introduction. We are concerned with certain intuitionistic refutations of 
theorems of classical analysis. Some of such refutations rely only on the 
intuitionistic but non-classical continuity condition attached to a Vw3x- 
prefix (‘Brouwer’s principle for numbers’, [5] 7); since Kleene’s investigation 
and formalization these seem well understood. But others, such as the 
refutation of («)(« = 0 > w + 0) for w ranging over real number generators 
((2] 8.1, [5] 17-18), rely on Brouwer’s method employing sequences 
developed by a ‘creating subject’ who looks for the solution to some un- 
solved problem. These have been slower to yield to formalization. One way 
to deal with them has been proposed by Kripke. We shall describe another 
and much less bold way. 

We shall not repeat here the details of Brouwer’s method, for which one 
should read not only the references and discussions in [2] and [5] cited 
above but also Kreisel [6] and Myhill [7], [8]. 

Nor shall we discuss in any detail Kripke’s proposed axiom schema 


(cf. [6], [7], [8]): 
KS IB{[VxB(x) = 0 ~ — AJ} & [AxB(x) # 0 = Al}, 


notation as in Kleene’s system I of [5], with A an arbitrary formula possibly 
containing free function variables. 

Let I* be I with *27.2 (Brouwer’s principle for numbers) replacing 
*27.1 (Brouwer’s principle for functions, or Yeif-continuity) as axiom 
schema. The addition of KS to I* seems to make possible the derivation of 
all the refutations under discussion here. This has been verified by Hull [3] 
for those results written out in [2]. 

Why should one be interested in an alternative approach to the refutations? 
These are some reasons: 

(a). One may wish a system retaining Brouwer’s principle for functions 
(* 27.1). As emphasized by Myhill [7], this form of continuity is inconsistent 
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with KS. It seems to be true that Brouwer never explicitly asserted this 
principle, but under one interpretation of intuitionistic intention one would 
expect every assertion (a)(Ef)A to be short for some assertion («)(E!f)A’, 
where A’ > A. And continuity for the latter follows from Brouwer’s 
principle for numbers ([5], p. 89). So Kleene was led to *27.1 as a way 
to make this interpretation as explicit as possible in the notation of I. 

(b). In an intuitionistic theory of species of higher order, one can be 
even more explicit. Here, as Myhill has observed, it seems natural to 
postulate 

(S){(«)(EB)S(@, B) > (ES')[(S’ S S & (a)(E!B)S'(a, B)]}, 
which contradicts KS. 

(c). Even if KS is weakened in that A is permitted no function variables, 
it still allows the derivation of formal theorems Ja£, FE containing « as only 
function variable, for the intended interpretation of which the a cannot be 
recursive. So under even this weak KS one misses a property which on 
Church’s thesis one might expect for an intuitionistic system (and which I 
possesses, by theorem 9.3 of [5]). 

On the other hand, for those who accept KS our formal work below may 
have some interest in verifying explicitly that Brouwer’s refutations do not 
use the full power of KS but can be obtained from a weaker principle. 


2. A new schema 

2.1. Suppose we want a system for intuitionistic analysis embracing the 
Brouwer refutations and the axioms of I (and hence not including KS). 
One possibility of course is simply to put in separately each refutation as 
an axiom or axiom schema. E.g. we would postulate — Va{— Vxa(x) = 
= 0 > dxa(x) # 0}. Kleene’s investigations in [5] 17-18 of several of these 
results have shown them to be independent and his arguments using realiza- 
bility and special realizability also show that these at least can be added 
consistently to I. (Note that intuitionistically if independence of A means: 
not Fk, A and not +; — A, then independence of A does not imply that 
— A is consistent with S; one needs also not ts — — A.) 

A more attractive possibility is to find a single axiom schema from which 
all the desired results would follow and which itself would be consistent 
with I. Since KS holds in some versions of intuitionistic analysis, it might 
provide intuitionistic support for the new schema if it could be deducible 
in I* from KS. Other support for it in the intuitionistic literature would be 
desirable also. 
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We now describe such a schema. It is related to the familiar principle of 
classical analysis: every continuous function f defined on a dense subset S$ 
of the continuum can be extended to a continuous function f* defined on 
the entire continuum. We restrict the dense subset to be negative, i.e. there 
must be a predicate A(«) such that «eS = A(«). We omit the continuity 
conditions on f, f* (f* is continuous anyway if completely defined). And 
we should prefer not to use the words above either, but rather to avoid any 
suggestion of a notion of a function defined not on the entire continuum 
but only on a (possibly undecidable) subset, or subspecies, since this notion 
of a partially defined function presents difficulties. 

Towards writing our new schema in I, we first express the density of the 
species A(x) in an arbitrary spread o ([5] p. 56) by means of the following 
formula: 


Dense(— A(a), 0): Vegeq VXB peo(B(x) == &(x) & — A(f)). 
For the case o is the universal spread v, we have then 


+ Dense(— A(a), 0) ~ VZgeq (2) 3B(B(UA(z)) = z & — A(B)). 


We now state our axiom schema (1) as a postulate for v rather than for 
the continuum of real numbers. (In (1) and (3) f is not free in A, and in (2) 
and (4) b is not free in A.) 


(1). Dense(— A(a), v) & Va(— A(a) > IBB(a, B)) 
> Vai B(— A(a) > B(a, B)). 


Actually, to obtain the Brouwer refutations we need postulate only the 
consequence for number-valued functions: 


(2). Dense(— A(a), v) & Va(— A(a) > JbB(a, b)) 
> Vaib(— A(a) > B(a, b)). 


Derivation of (2) from (1) in I is straightforward. 
As consequences in I of (1) and (2) respectively, we can now obtain the 
corresponding statements for arbitrary spreads. 


(3). Spr(a) & Dense(— A(a), 6) & Voree(—a A(a) > ABB(a, B)) 
= Al A(a) = B(a, B)). 


(4). Spr(c) & Dense(— A(a), 6) & Voqeq(—a A(x) > ADB(a, b)) 
> Voqeqdb(— A(x) > B(a, b)). 
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2.2. Let I° be I with (1) as added axiom schema. We shall show I° is con- 
sistent by showing its theorems are ,realizable ([5] 10-11). 


THEOREM 1. Let A, B(f) be formulas of 1 of orders a and b respectively 
((5] 10.5), A not containing B free. Then the following formula is ,realizable. 


(oe (8) ae ee ea ()) 
where c is a * 1, etc. 


Proor. Let ¥ be a list of variables including all those except f free in 
A, B(B). For any ¥ the formula (5) is ,realized-¥ by 2A °e e[*e] where 
gfe] = <(*{E}[O)o, 44°(“{%e}[70]),. 

For, suppose (i) “e ,realizes-'¥ the hypothesis. We must infer that [’e] 
stealizes-¥ the conclusion. We need that {A‘a(“{‘e}[0]), ,realizes-¥ 
{(*{°e}[°0])o} “= A’ > B(B). Suppose ‘a ,realizes-Y¥°— A. Then (ii) ‘0 
stealizes-¥ °— A. We need that (“{’e}[°0]), ,realizes-¥ {(“{°e}[0])o} B(A). 
This follows from (i) and (ii). 


Coro.uary 1. If [+ E in Y° and the formulas T are ,realizable, then E is 
realizable. 


CorROLLary 2. In I° not + KS. 


2,3. 

THEOREM 2. Let S be the formal system obtained by adding KS as axiom 
schema to Kleene’s basic system B ({5] p. 8), or even to B less the bar theorem 
§26:3. Then in sia): 


Proor. Assume the hypotheses (i) and (ii) of (1). By “2.1 applied to (i), 
assume 


(ili). V2seq)(B_(Ih(z)) = z & — A(B,)), 
where B, = AxB(<z, x>). By KS 
dy{[Vxy(x) = 0 ~ — A(a)] & [5x7(x) # 0 > A(@)]}. 
Assume 
(iv). [Vxy(x) = 0 ~ — A(a)] & [3xy(x) 4 0 > A(@)]. 
Now define ¢€ 


_ fa(x) if Vz,<xy(z) = 0, 
Co e25 3) — \Baoy(x) if 3z,<.7(z) # Oand y = yz,<x9(2) # 0. 
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Then (vi) — A(¢), for assume for reductio ad absurdum (a) A(¢). We shall 
deduce — — Axy(x) #0 and —4xy(x) #0. For the first, assume 
— Axy(x) 4 0. Then Vxy(x) = 0. So — A(a) by (iv) and also € = « by (v). 
So — A(¢), contradicting (a). For the second, assume Ixy(x) 4 0. Assume 
y = 2, <xy(z) #0. Then € = Bz) by (v) and by (iii) — A(¢), again contra- 
dicting (a). Hence, (vi). So by (ii) and (vi) assume B({, ¢). Now toward 
deducing — A(a) > B(a,c), assume — A(a). Then Vx>(x) = 0 [(iv)]; 
so € =a. So Bia, a). By >, V-introd. Vado(— A(a) > B(a, c)). 


2.4. We shall not derive in I° (or I plus (2)) each of Brouwer’s refutations, 
but only some representative ones. Though we have not written out every 
detail of the other derivations that would be required, we believe, on the 
basis of extensive checking not reproduced here, that in I plus (2) we can 
obtain all the counterexamples given or cited in [2] 8.1. 

In theorem 3 the formula in (a) is the negation of ‘Markov’s principle’ 
(extended to functions, cf. Kleene’s remark 18.6 in [5] p. 185); (b) and (c) 
correspond to Brouwer’s most often given counterexamples ([2] pp. 
117-118); (d) is a lemma used to establish (e) (a description of our proof of 
(d) has already been given by Hull [3]); (e) establishes that the virtual 
ordering < of the continuum is not a pseudo-ordering ({2] p. 117 lines 
7-8, [3]); (f) corresponds to Heyting’s theorem 2 ((2] p. 118) with 1 
replacing 0. This last is the only refutation treated here which appears not 
to be derivable from (a) as axiom replacing (1) (or (2)). Another in the 
same class is our formal version of the negation of the Bolzano-Weierstrass 
theorem ([2] p. 119). 


THEOREM 3. In I° (or in I plus (2)): 
(a).  — Va(— Vxa(x) = 0 > Ixa(x) # 0). 
(b). | 4 Vagea(—% & 0 > a + 0). 
(c). | 4 Voper(a > 0 > a > 0). 
(Oy Vek = yVy =e) > aS 0. 
(f). § 4 Va,r(e2loa< iva > 1). 
Proor. (a). We easily deduce Dense(— Vxa(x) = 0,0). Next assume 
Va(— Vxa(x) = 0 > dxa(x) # 0). By (2), Vadx(— Vxa(x) = 0 > a(x) # 0). 
Using *27.2, assume 


(i). Vady{r(&(y)) > 0 & Vx(t(&(x)) > 0 > x = y) & (A Vxa(x) = 0 
> a(2(@(y)) + 1) # 0)}. 


202 R. E. VESLEY B XII 


Letting « = Ax0, we can assume 
(ii). t(Ax0(y,)) > 0. 
Define «,: 


0 if x < c(Ax0(y,)) vx <y,, 
1 otherwise. 


(i) ce (0) 
Then (iv) «,(y,) = 4x0(y,). Putting «, in (i), we can assume 


(v). t(a1(y2)) > 0, 
(vi). Vx(t(%(x)) > 0 > x = y2) 
(vii). —Vxe,(x) = 0 > a,(t(a,(y2)) + 1) 4 0. 


From (ii), (iv), (vi) ¥1 = ¥2- Now 04 a, (t(a,(y2)) + 1) (by (vii), ili)) 
= o4(t(a1(¥1)) = 1) = o4(tx0(,)) = 1) (by (iv)) = 0 [iit)]. 

(b), (c). Cf. Kleene’s remark 18.6 [5] p. 185. 

(d). Assume (i) «€ R and (ii) Vy,er(0 < yvy < a). By (i) and *R1.11 
((5] p. 138), assume (iii) @’ € R’ & «’ = a. Using (ii) and *RO.7 with (iii) 
and *R7.3, we deduce Vy,<x(0 < yvy <a’). Thence from *RO0.8 and 
*27.6, assume 


(iv). Vyew {VL t9(x)) > 0D y = x] & 
{(0 < y & t(y)) = 1) v(y < & & tH(y)) = 23}. 
Putting y & «’, assume 
(v). ¥x[t(a’(x)) > O0>y =x] & 
{0 <a & (@"(y)) = Iv @ Xo! & e(%(y) = 2}. 


Using *R7.6, (vi) t(a’(y)) = 1. Now (vii) «’(y + 1) + 1 > 0, as follows. 
Assume (1) a’(y+1)+1 = 0. Using *R9.20 assume (2) Pe R’ & B(y) = 
x(y) and (3) B &(w(y+1)=1)+2°°"9, By (3) and (1), (4) 
p =0-2-°°) 2 0 [using *R9.4]. From (iv) with f for y, (vi) and (2), 
0 < £, contradicting (4). 

Now @ € (a’(y+1)+1)°2°°° [*R9.16] > 0-27") 20. Then by 
(iii), *R7.3, « > 0. 

(f). We shall show contradictory: (i) Va,p(A«£loa<ilva > 1), 
whence (f) [using *RO.7]. From *RO.8 assume (ii) Spr(c) & Va(a eR’ ~ 
aea). We deduce (iii) Dense(— « & 1,c¢), as follows. Assuming a€o, 
define B 
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a(x) Ne SS < y. 
, ell ay cea eo 
CRO a(x) Sy = eee Qn es oe, 


2B(x~+1) if x > y. 


Then (v) B(y) = &(y), and by cases (x+1 <y, xt+1=y, x41 > y; 
with subcases (a(x+1) = 2**?, a(x+1) 4 2***) in the second case): 
[2B(x)—B(x+1)| < 1. Thence (vi) Be R’. By induction on p, using (iv) 
|P(y+p)—2"*?| > 2?, whence 2’|B(y+p)—2’t?| > 2”*? and 


— AxVp 2?|B(x+p)—2**?| < 2**?; 


so (vii) — B & Ax2* 2 1. Combining (v)-(vii): (iii). From (i) and (ii): 
(Vili) Voce &@ & 1 > Ix[(x =O &a < 1)v(x=1&a>1))). Using 
(iii) and (viii) in (2) 


lle 


Wore, x(x — 0 & o =< NV (ie = 1 & o > 1)). 


Using *27.5, assume 


(ix). Vogee dy {t(Z(y)) > 0 & Vx(t(K(X)) > OD x = y)& 
[ee aleea( nial) al Goo — 1) (c(e(y)) = 2.0 = 1/))}. 


Putting « = 1x2*(2 1), let w, = t(Ax2*(y,)) > 0. By cases (w, = 1, 
Ww, > 1) with (ix) we can easily deduce a contradiction. 


In the proof in [1] of the result (c) Brouwer seems to assert the continuity 
of a number-valued function defined on a particular negative dense proper 
subspecies of a certain spread, namely on the real number generators a with 
a > 0. See particularly p. 123 lines 8-10 from below. This is the closest 
we can come to direct evidence for (1) in Brouwer's texts, and it is very 
weak. For his use of what might be taken as a consequence of (1) appears 
unnecessary for his argument, which he bases instead on use of the creating 
subject. 


3. Systems contradicting (5) or (1) 
3.1. To derive the Brouwer refutations we could start not from (2) but 
from the stronger schema 


(6). (4 A > 4xB(x)) > 5x(— A > B(x)), 


which is itself a consequence of (5). We do not do so, not only because 
we can find no derivation of (6) from KS and no evidence for the intuition- 
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istic truth of (6), but also because we have evidence against (6), namely 
its inconsistency with the ‘strong Kripke schema’ [7] 
KS 5B(AxB(x) # 0 ~ A). 

Let S’”’ be the system obtained by adding KS* to I*. We sketch the proof 
that (6) is contradictory in S”. By KS* 3f(4xB(x) # 0 ~ — Vxa(x) = 0). 
Call this 18K(«, 8). Assume K(a, 6). Now 

K(a, B) & (— Vxa(x) = 0 > AxB(x) # 0). 
Using (6) etc. AfAx(K(a, B) & (— Vxa(x) = 0 > B(x) 4 0)), whence 
VadxdB(K(a, B) & (— Vxa(x) = 0 > B(x) # 0)). Using continuity (*27.2) 
and then putting « = 1x0, we can deduce (formulas from which we may 
assume) 


Vetzeq3B(K (ss B) & (— Vxa(x) = 0 > flv) # 0)) 
for 6,),,7 satisfying Spr(o) & Va(aeo0 ~ Vx,<,,a(x) =0) and y. = 
t(Ax0(y,))+1. Thence 


Veteo JUAB(B(y2) = u & K(a, B) & (— Vxa(x) = 0 > Bly2) # 9) 
write Vez, JuABB(a, B, u, y2). Then assume by continuity 


Vozes (al yt (Hy) > 0 & Vy(t'(&(y)) > 0 > SBB(a, B, t'(&y))+1, y2))}- 


Putting « = Ax0, assume 1’(&(y3)) > 0 & Vx(t'(&(x)) > 0 > x = y3) and 
B(AxO, B, t'(Ax0(y3)) +1, Ya). If B(y.) #0 then by K(Ax0, 8) we have 
— VxAx0(x) = 0; so c’(Ax0(,))= 1 = 38093) = 0. Hence 


VatzeolVXx<y,4(x) = 0 > ABB(a, B, 0, y2)], 
whence we can deduce 
VOpee[VXx<y,0(x) = 0 >  — Vxa(x) = 0], 
which is easily shown contradictory. 


3.2. A way of phrasing (1) is to say that in the assertion that « is in the 
negative dense species A«A(«) there is conveyed no information necessary 
to determine a choice sequence f corresponding (via the relation B) to a. 
All such information is already given in «. On one view even this weakening 
of (5) is implausible; doesn’t the assertion A(q) tell us at least of the existence 
of a certain derivation from A(«) of a contradiction? One may prefer a 
formal theory in which (1) fails. We shall show that in Troelstra’s theory 
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CS of [9] we can derive a contradiction from the conjunction of (1) and the 
following modification of (1) in which the f of (1) is replaced by a variable 
e ranging over K. 


(7). Dense(—A(a), v) & Va(—A(a) > JeB(a, e)) > Vade(— A(x) > B(a, e)). 
Using axiom schema F2 of CS 
Va(— Vxa(x) = 0 > Je(Ap a = elf & VB — VxelB(x) = 0)). 
Using (7) 
Vade(— Vxa(x) = 0 > SP(a = elB & VB — VxelB(x) = 0)). 


Applying the CS continuity schema F3, putting « = At0, and then using 
(3), we may assume 


VOgc0 IB(— Vxe(x) = 0 > a =f|B & VB — Vxf|B(x) = 0) 


for o satisfying Spr(c) & Va(aeo ~ Vx, <y,a(x) = 0) for a certain y,. 
Then by the Vaif-continuity schema F4 assume 

(i). Vozeq( a Wx0(x) = 0 > a = fl(gla) & VB — Vxf1B(x) = 0). 
Thence — Vxf|(g|At0)(x) = 0. But also we can deduce Vxf|(g|At0)(x) = 0 
as follows. Assume f|(g|At0)(x) # 0. Thence assume f(% * (g|A10)(y)) > 1. 
We can deduce formulas from which we may assume 


(ii). Vo(Vx,<y,«(x) = 0 > f(% *(gla)(y)) > 1. 


Then define a, 

Ber 0 if u < max(x, y;, y2), 

pe) ts otherwise. 
From (i) 2, =f\(gla:). But c,(x) = 0 [(iii)] 4 fI(gles)(x) [Gii), (i). 

We do not know if (1) alone is contradictory in CS, nor do we know the 
status of the Brouwer refutations with respect to CS. 

(We should like to thank Joan R. Moschovakis for calling to our attention 
an error in our original discussion and for pointing out that our argument 
does not contradict (1) but only the conjunction of (1) and (7).) 


4. A metatheoretical application of (2). Suppose one accepts the principles 
of I together with (2) as defining one kind of intuitionistic theory. Using 
this theory informally as metatheory what results concerning I or I° follow? 
We give in theorem 4 one result: there is a model for I in which the function 
variables range over the free choice sequences which are not not general 
recursive. Classically the theorem fails (by Kleene’s corollary 9.9 [5] p. 
113); its ‘intuitionistic proof’ here (i.e. proof in the informal version of I°) 
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requires not only (2) but (the classically invalid) Brouwer’s principle for 
numbers. 

Presumably there is a parallel to this theorem in which I° replaces I and 
C/,realizable replaces C/realizable. I have not checked details of this. 

For the theorem let C be the intuitionistic species of all choice sequences 
a satisfying GR(«) where GR(«) = « is general recursive. The theorem is 
proved under the hypothesis that the proof of Kleene’s theorem 9.7 of [5] 
p-111 can be carried out in the informal version of I°. This hypothesis (or 
even that the proof could be carried out in B) seems extremely plausible 
but we make no attempt to justify or further elucidate it. Its verification 
may emerge from Kleene’s recent work [4]. 

THEOREM 4. Using informally the principles of V° (and specifically *27.2 
and (2)): If + E in I then E is C/realizable. 

Proor. We apply Kleene’s theorem 9.7 cited above. Additionally we need 
to consider the case of axiom schema *26,3° (the bar theorem). We adapt 
Kleene’s treatment of that case in his proof of theorem 9.3 [5] p. 107, 
now using Brouwer’s principle and (2) to show the well-foundedness of the 
tree for the informal bar induction. Specifically, we take over as (1’}-(4’) 
the statements (1)-(4) of p. 107, with the modification that ¥, 7, , po, 1,0 
must be in C. Next, since we have by hypothesis only that {(7)o, o}[«](c) 
is defined for each « in C, and not for arbitrary «, we need to modify the 
argument at the top of p. 108. Let 


R'(x, a) = (Ea)[GR(x) & a = a(x) for x = ({(m)o,o}%KO))ol. 
We shall show (5’) (8)(Ex)R’(z, B(x)). By hypothesis, (7)o,9 C/realizes-¥ 


VBAxR(B(x)); thence (B)[GR(B) > {(7)o, o | [8 ](0) is defined]. So (8) [GR(B) > 
(E!u)(7)o, (2 *B(u)) > 0]. By (2), and the density in our sense of the not 
not-general recursive choice sequences in the species of all choice sequences 


(B)(Eu)[GR(B) — (n)o, (2 * B(u)) > 0 & (0)[(n)o, o(2* BQO) > 0 > u = oJ], 
Abbreviate this (f)(Eu) [GR(B) —» B(x, B, u)]. By Brouwer’s principle assume 


(i). (B(Ey){x(BO)) > 0 & (z)(c(B(z)) > 0 > z = y) 
‘ & [GR(B) > B(x, B, (B(y))=1)]}- 
(ii). c(BQY)) > 0 & (2)(c(B(z)) > Oz = y). 
& [GR(B) > B(x, B, ~(B(y))= 1]. 
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Let x = (()o, o(2* B(c(B(y))=1))= Do. Define a, 
GO he if t < max [x, t(B(y))=1, y], 


0 otherwise. 


By (i) there is y, such that 
(iv). e(@%ly,)) > 0 & (2)le(@(2)) > O42 = y,) : 
& [GR(«,) > B(x, «,, t(a,(y,))— 1}. 
Now GR(a,); so (v) B(x, «,, t(%,(y,))+1). Then 
(vi). {(mo, of [%11) = (#)o, o(2*a1(t(a1(y1)) +1) = 1. 


Also t(@,(y)) = a(B(y)) [(ii)] > 0 [(ii)]. By (iv) y = y,. So t(a,(91)) = 
a(B(y)), and (vii) «(t(@1(91))+1) = B(c(B(y))~1) [(iii)]. So from (vi), 
(vii) ({(*)o, 0} [41]O)))o = (*)o, o2*B((B(Y))=1))= 10 = x. From (ii) 
iy = 0,(*). So GR(a,) & B(x) = a, (x) & x = ({(a)o, 0} [#1 ](0))o, whence 
a): 

Now the definition of R,, the derivation of (6') (with R’ replacing R) 
and the remainder of the proof can proceed as before. 
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SECTION C 


CLASSICAL INTERPRETATIONS 
OF INTUITIONISM 


A CHARACTERIZATION OF THE 
INTUITIONISTIC PROPOSITIONAL CALCULUS* 


D. H. J. DE JONGH 


We will present here a characterization of the intuitionistic propositional 
calculus Pp from above, i.e. we will describe a property of Pp that no 
consistent propositional calculus stronger than Pp possesses. By a proposi- 
tional calculus stronger than Pp (at least as strong as Pp) we understand 
one in which all formulas in Pp are provable and some others as well (and 
possibly some others as well), and which is closed under substitution and 
modus ponens. (Closure under substitution is, of course, guaranteed if no 
particular axioms are postulated, but only axiom schemata.) By a formula 
we understand a formula built up from Y,,..., A, (the atomic formulas) 
with the connectives &, Vv, > and —. We will follow the notation of Kleene 
ie. 

Lukasiewicz [11] proposed the conjecture that Pp can be characterized 
from above by the property: 


for any formulas 2, 8, if tpp Uv B, then pp W or tpp B. 


This conjecture was disproved by Kreisel and Putnam [9], who showed 
that Pp+ the axiom schema (— &% > Bv€) > (A A> BV)v(HAABaC) 
has the same property. 

In [8] Kleene proved a stronger property of Pp and he subsequently 
proposed to the author the conjecture that this property characterizes Pp 
from above. First, one defines a notion [|r for any sequence I of formulas, 
any formula Y% and any propositional calculus T, from the notion fr of 
provability in T. Kleene states the definition in [8] in particular for the case 
that T is Pp (cf. [8] § 4), and he proves (among other things) that, for each 
W, B,C, if Wp, 2 and tp, 2 > BvG, then tp, MW > GB or tppA a. 


Kleene’s conjecture, which we will confirm in this presentation is: if T is a 


* Part of the author’s typewritten doctoral dissertation. 
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propositional calculus at least as strong as Pp, possessing the property 
if Wr Wand ty YW > BvG, then kz W > Bor tr A > ©, (G) 


then T is Pp. Also we will give another characterization of Pp from above 
by replacing (*) by 


if Ue W and te (A > B) & (B > W), then Bix B. (**) 


For this purpose we discuss some connections between pseudo-Boolean 
algebras and J-valuations (see for the definition of J-valuations Beth [1], 
Kripke [10], de Jongh [4]). If 4(F,,...,A,) is a formula we will write 
YW*(a,,...,%,) for the pseudo-Boolean algebraic term formed from 
@1,-..,%, with Apes = andgesin the same audy ase?! frome?) ee 
with & v, > and —. A formula Y is said to be valid in a pseudo-Boolean 
algebra A, iff U%(q@, pee, ) el forcall'a prec ede 


THEOREM | (McKinsey and Tarski [12]). 
(a). kpp 2 iff YU is valid in every pseudo-Boolean algebra. 
(b). frp WU iff U is valid in every finite pseudo-Boolean algebra. 


DEFINITION |. For any propositional calculus T at least as strong as Pp 
we say that a pseudo-Boolean algebra A is a T-pseudo-Boolean algebra 
iff for each formula YF such that fy Wf, is valid in A. 


THEOREM 2. For every propositional calculus T at least as strong as Pp, 
by Q iff YU is valid in every T-pseudo-Boolean algebra. 

Proor. Immediate from results of Birkhoff [3] on equationally defined 
classes of algebras and the fact that T-pseudo-Boolean algebras can be 
defined by a system of equations, since pseudo-Boolean algebras can 
(cf. e.g. [14]). 


Another special case of a theorem of Birkhoff [3] is 


THEOREM 3. The class of all T-pseudo-Boolean algebras is closed under the 
formation of sub-algebras, homomorphisms and direct products. 


The following definitions are from [6] (mostly originally from [13]). 


DEFINITION 2, If a partially ordered set (V, <> is a complete lattice, then 
ae V is called join-irreducible iff « > \) {B: B < a}. The set of all join- 
irreducible elements of V will be denoted by V’°. 
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DEFINITION 3. A lattice V is called join-representable iff V is complete and 
completely distributive, and every « € V can be written as « = ()} {B: B < a 
and Be V°}, 
DEFINITION 4. A subset F of <V, <> is called M-closed iff for all p, ge V, 
peéFandg < pimplyqeF. 

The set of all 44/-closed subsets of a partially ordered set V will be denoted 
by V. Vis then complete and completely distributive. 


THEOREM 4, ([13], [6]). Every join-representable lattice V is isomorphic to We, 


THEOREM 5. (e.g. [2]). A complete and completely distributive lattice is a 
pseudo-Boolean algebra if we define « => B = \) {y:any < Bh. 


Every finite distributive lattice is complete, completely distributive, and 
join-representable (e.g. [2]). So theorem 5 implies that every finite distribu- 
tive lattice is a pseudo-Boolean algebra V for some partially ordered set V. 
Since for every partially ordered set V, V is a distributive lattice, there is 
a 1-1 correspondence between finite pseudo-Boolean algebras and finite 
partially ordered sets. 


DEFINITION 5. If V is a partially ordered set, then V is T-admissible iff V 
is a T-pseudo-Boolean algebra. 
DEFINITION 6. A P.O.G.-set is a partially ordered set with a greatest element. 


THEOREM 6. (essentially in [6]). If V is a P.O.G.-set, then there is the 
following correspondence between any I-valuation <V,w> and the pseudo- 
Boolean algebra V: 
for all formulas U, 8, if F = {pe V: w(p, X) = 1} andG = {p: w(p, 8) 
= |}, then 
Gag G an wipe se) 1 t 
Ce OG = ipo wep, lve) = 1, 
iid een) ee) ait 
(iv) = po w(p, — 2) = I}. 


THEOREM 7. If V is a P.O.G.-set with greatest element m, then V is T-admissible 
iff for all I-valuations <V, w) and all formulas X such that ty UW, wim, W) = 1. 


We will need a short resume of some results of [5]. 


DEFINITION 7.(a). An J-function is a function with domain a finite P.O.G.-set 
V and range the set {0, 1} with the property that, ifg < pin Vand/f(p) = 1, 
then (¢ "1. 
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(b). An J"-function f is a function with domain a P.O.G.-set V and range 
the set {0, 1}” such that for all m (1 < m < n) the function f” defined on 
V by f"(p) = (f(p))(m) is an Ffunction. 

We write F” for the set of all J°-functions and F for the set of all 
functions, m, for the greatest element of D,, the domain of f, and <, 
for the partial ordering of the domain of f. 

DEFINITION 8.(a). f, g € F” are congruent by ¢ iff g is an isomorphism from 
D, onto D, such that f(p) = g(y(p)) for all pe D,. 

(b). f is congruent to g (in symbols f = g) iff f is congruent to g by @ 
for some @. 

DEFINITION 9. An n-ary J-operator a is a function from F” into F with the 
properties: 

(i). Dacyy = D, for all fe F". 

(ii). iff = g by 9, then a(f) = a(g) by g. 

If V is a partially ordered set and pe V, then we write V(p) for the set 
{p'eV:p' < p}. If fe F” and pe D,, we write f, for the restriction of f 
to D,(p), g < fiff Ap)\(g =f). 

DEFINITION 10. A function g fiom the partially ordered set <V, <)> onto the 
partially ordered set (W, <> is strongly isotone iff 

(i). for all p’, pe V, if p’ < p, then g(p') <, (Pp) 
and 

(ii). for all p’, pe V, if p(p’) <1 v(p), then for some p” < p, g(p") = 
= 9(7’). 

DEFINITION 11.(a). If f, ge F", then g is a reduced form of f iff there is a 
strongly isotone function @ from D, onto D, such that, for all pe D,, 
Io(P)) =f(P). 

(b). fis irreducible iff all reduced forms of f are congruent to f. 

(c). If g is a reduced form of f and g is irreducible, then we call g a 
normal form of f. 


In [5] we proved that the normal form of an J"-function is unique up to 
congruence (theorem 2.3). Also we proved that, if f is irreducible and 
g <f, then g is irreducible (lemma 2.2 Cor.). 


DEFINITION 12. A normal J-operator a is an J-operator such that 


(i). for all fe F" and pe Dy, (a(fp))(P) = (a(F))(P) 


and 
(ii). iff, géF" and g is a reduced form of f, then (a(g))(m,)=(a(/))(m,). 
DEFINITION 13. The normalized characteristic set C* of a normal n-ary 
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I-operator a is the set of all irreducible /"-functions f such that 
(a(f))(m;) = 1. 


In [5] it was proved that a normal J/-operator is uniquely characterized by 
its normalized characteristic set. 


DEFINITION 14, A normal J-operator is finite iff its normalized characteristic 
set contains only a finite number of congruence classes. 


In [5] we defined then /-operators corresponding to the usual connectives 
in a natural way. These /-operators were proved to be normal (and in the 
cases of & and — finite) and we proved that, if we introduce definability 
in a natural way, then all finite normal /-operators are definable from the 
L-operators corresponding to the usual connectives (theorem 3.7). More in 
particular: 


THEOREM 8. Let geF", g irreducible, m, has k immediate predecessors 
G1>-++> 9% with respect to <,, for each i (\ < i< k) q; has k, immediate 
predecessors with respect to <,, and for alli andj (\ <i< k,l1<j<k;) 
a; and a;, are respectively the normal I-operators with normalized character- 
ioc, — fe lyf <9.) vand C= 4 felis < g,,\. Then, 
in the case that for all m (1 < m<n)f"™(p) = 0 for some p < m,, the 
normal I-operator a with normalized characteristic set Cf = {fe F":f < g} 
can be expressed as follows: 


ki 
(with o a;, standing for a; if k; = 0). 
j= 


The main theorem is a little bit stronger than we need to establish the 
results predicted earlier. We have not checked the intuitionistic validity of it. 
Probably the double negation will hold intuitionistically. 


THEOREM 9. If T is a consistent propositional calculus stronger than Pp, 
then for each integer r > 2 there is a formula U> B,v...v¥B, (s 2 r) 
such that U|rU and ty UW > B,v...vB,, but not try U> Biv...v Bi, 
for any proper subsequence (i,,...,i,) (k > 1) of (1,..., 5). 

OUTLINE OF PROOF. First we construct a finite P.O.G.-set <W, <9, 90> 
having q,,.-., 4, as the immediate predecessors of ¢y (k > 1), such that 
W is not T-admissible, but, for all i (1 < i < k), W(q;) is T-admissible. 
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Then from this P.O.G.-set W we construct a P.O.G.-set <V, <, po> with 
P1,+-+5DP, (s 2 1r) as immediate predecessors of po such that V is not 
T-admissible, but for all j (1 <j <-s), V(p,) is. Then we construct an 
irreducible J"-function g on V for some n such that, for all m (1 < m < n), 
g'(p,) = Ofor some 1 (1 <5), 

Now assume 4, a;,..., a, are the definable /-operators with normalized 
characteristic sets Ci = {feF":f<g} and Ci, = {feF":f< g,,} 
(1 <i<_s), and assume that WU, B,,..., 8, are the formulas corresponding 
to these definable J-operators. We will show 

(a)> Fpl BV oe 

(b). not kr WU > B,v...v%,, for any proper subsequence (i,,..., ix) 
ob (nae): 

(c). Wr 2. 

(a). The crucial point of the proof is that the class of T-pseudo-Boolean 
algebras does not contain a pseudo-Boolean algebra on which 
W> B,v...v B, is not valid (a ‘counter-example’ to this formula). More 
precisely, for any pseudo-Boolean algebra A on which 2 > B,v...v%, 
is not valid, the pseudo-Boolean algebra V is isomorphic to a sub-algebra 
of a homomorphism of A, and so theorem 3 implies that, since V is 
not a T-pseudo-Boolean algebra, A cannot be one, and therefore 
Fr tl > Ov ev .. In eliect, if ope, Oo, Ceteabesnchuihat 


ee eMC OOM imag, (eo) eo 1h. 


then V can be proved to be isomorphic to the sub-algebra of the relativization 
of A with respect to U*(a,,...,a,) generated by the images of a,,..., % 
under the natural homomorphism on that relativization. 

(b). If té(i,,...,i,) (O<t<s), then we can prove that I>, v...v B,, 
is not valid on V(p,); then, since V(p,) is a T-pseudo-Boolean algebra, 
not Pr Wl > Dee Oe 

(c). This is easy to prove with the help of theorem 9 and the definition 
of lr. 


n 


For our second characterization we give first an equivalent expression for 


Wr Ww. 


DEFINITION 15. a is a connected I-operator iff a is normal and for all f, g € C; 
there exists an A such that f < handg <h. 


Of the next theorem we again did not check the intuitionistic validity. 
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THEOREM 10. For any formula U, Up, WU iff the I-operator corresponding to 
Y is connected. 


From this theorem and the proof of theorem 9 theorem 11 follows almost 
immediately. 


THEOREM I1. If T is a consistent propositional calculus at least as strong as 
Pp, and 


for each UX, B if Ue U and tz (A > B) & (B > A), then Blr B, (**) 


then all theorems of T are provable in Pp. 
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INTUITIONISTIC MODEL THEORY AND THE COHEN 
INDEPENDENCE PROOFS 


MELVIN FITTING 


Gédel proved the continuum hypothesis consistent with the other axioms 
of set theory [2] by constructing a transfinite sequence of (domains of) 
classical logic models {M,}, taking a limit L, over all ordinals, and showing 
it was a model for set theory and the continuum hypothesis (among other 
things). We will indicate how this procedure can be generalized to transfinite 
sequences of Saul Kripke’s intuitionistic logic models [7] in such a way as 
to establish the independence results of Cohen [1]. 

This sort of thing has been done by Vopénka and others (see refs. [3]-[5] 
and [10]-[14]) using topological intuitionistic models. Kripke’s model 
structure is closer in form to Cohen’s forcing technique, and the methods 
used are more ‘logical’. Neither Vopénka’s nor this method requires count- 
able models for set theory. 

First I will briefly sketch Kripke’s notion of an intuitionistic . 
model, since the notation I use is different from his. 

Notation. If P is a function ranging over sets of parameters, by P(I) 
we mean the collection of all first order formulas with constants from P(I). 
% is any atomic formula, X and Y) are any formulas. 


DEFINITION |. By an intuitionistic model we mean an ordered quadruple 
<G, R, F, P>, where G is a non-empty set, R is a transitive, reflexive rela- 
tion on G, F is a relation between elements of G and formulas, and P is a 
map from G to non-empty sets of parameters, satisfying for any , 4 € G 

Goer RA = PUT) P(A); 

(2). PEW => We PL); 

Geer 2, FRA SA Po; 

(4). FE(KAY) ere xX andre Y; 

(5). FE(EVY) <> (EVM) Ee P(r) andr kt X ork Y, 

(6). Fk ~ ¥<> ~ XeP(l)and for every 4¢G such that PRA, A ¥ X; 
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(7). FE(X > Y)+(X > Y)eP(L) and for every 4¢G such that 
TRA, if AEX, AF Y; 

(8). FF (Ax)X(x) <> for some ae P(I'), FF X(a); 

(9). FF (Wx)€(x) <> for every A eG such that PRA, and for every ae P(A), 
AF X(a). 


DEFINITION 2. & is valid in the model <G, R, F, P> if for every [ € G such 
that Xe P(r), TEx. 
& is valid if X is valid in every model. 


THEOREM 1. (Kripke [7]). & is a theorem of intuitionistic logic if and only 
if & ts valid. 


The above modeling may be briefly motivated as follows: 

G is a collection of possible states of knowledge; any [eG may be 
considered to be a collection of physical facts. FRA means if now we know 
I, later we might know A. P(I) is the set of constants constructed by the 
stage I’, or the set of parameters introduced in reaching I. Finally [FX 
means that from the facts Fr we may deduce &. 

If a model <G, R, F, P> has a countable domain, i.e. |) {P(I)|II € G}, we 
may apply Cohen’s complete sequence method [1]. Call H | G an R-chain 
if any two elements are R-comparable. Call Ha complete R-chain if forany 
formula X, only using parameters ‘available’ in H, for some le H, PFXv ~%. 
Then, as in [1], any I € G can be included in a complete R-chain. If H is a 
complete R-chain, {X| for some I € H, I F X} is, if we ignore the universal 
quantifier, a classical truth set. Now suppose & has no universal quantifiers 
and ~~ is not an intuitionistic theorem. The analog of the Skolem- 
Léwenheim theorem holds for Kripke’s models, so for some model with 
a countable domain <(G, R, F, P> for some T €G, I ¥ ~~. For some 
AeéG, PRA and A F~%. By the above remarks ~X must belong to some 
classical truth set, so X is not classically valid. (This can be extended to a 
full proof of Kleene [6] theorem 59). 

Suppose we could find some intuitionistic model <G, R, F, P> in which 
for some f€ G, FF ZF and I ¥ ~~AC, where ZF is the set of Zermelo- 
Fraenkel axioms, and AC is the axiom of choice, all expressed in (classi- 
cally equivalent) forms not using the universal quantifier. Then by the above 
we would have the classical independence of the axiom of choice. (Note 
that -(X¥ > ~~Y) = ~~(X > Y).) Before showing how this may be 
done, we present the Gédel construction in order to bring out the analogy. 
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Let V be a classical Zermelo-Fraenkel model. In [2] Gédel defined over 
V the sequence {M,} of sets as follows: 

Me er, 

M,+, 1s the collection of all definable subsets of M,, 

M, = U2<aM, for limit ordinals 2. 

Let L be the class | ),.y M,. Godel showed that L was a classical ZF model. 

As an introduction to the intuitionistic generalization, we restate the Gédel 
construction using characteristic functions instead of sets. Now of course 
‘e’ is to be considered as a formal symbol, not as set membership. 

Let M be some collection and let v be a truth function on the set of for- 
mulas with constants from M. We say a (characteristic) function / 1s defin- 
able over <M, v> if domain(f) = M, range(f) S {T, F}, and for some for- 
mula X(x) with one free variable and all constants from M, for all ae M 


F(a) = v(X(a)). 


Let M’ be the elements of M together with all functions definable over 
«M, v>. 

We define a truth function v’ on the set of formulas with constants from 
M’ by defining it for atomic formulas. If f, g € M’ we have three cases: 

(1). figeM. Let v'(feg) = v(feg); 

yey = Nagel — iV, Let v( jeg) =); 

(3). fe M’—M. Let X(x) be the formula which defines f over <M, v). 
If there is an he M such that v((Vx)(x eh = X(x))) =T and 
v'(heg) =T let v'(feg) = T, otherwise let v'(feg) = F. (Case 3 
reduces the situation to case | or 2.) 

We call the pair ¢M’, v'> the derived model of <M, v). 

Now, let My = @ and let v9 be the obvious truth function. Thus we 

have (Mo, t9>. 

Let (M,41> +1) be the derived model of (M,, v,>. 

If A is a limit ordinal, let M, = (),<,M,. Let v,(fe g) = T if for some 
a <d,v,(feg) = T, otherwise let v,(fe g) = F. Thus we have ¢(M,, v,). 

Let L = |), .yM, and let (fe g) = T if for some we V v,(fe g) = T, 
otherwise let v(fe g) = F. Thus we have the ‘class’ model <L, v>. All the 
axioms of ZF will be valid in this model. 

Before proceeding to the intuitionistic generalization, we note that it can 
be shown that for formulas without universal quantifiers it suffices to con- 
sider only intuitionistic models with the P-map constant. From now on we 
will assume this, and we will write the range of the map instead of the map 
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itself. Thus our models now are quadruples ¢(G, R, F, S>, where S is a collec- 
tion of parameters, etc. Also, for convenience, let B be the collection of all 
R-closed subsets of G. 

Let <G, R, -, S> be some model. We say a function f is definable over 
<G, R, F, S> if domain(/) = S, range(f) © B, and for some formula X(x) 
with one free variable, all constants from S and no universal quantifiers, 
for anyae S 


F(a) = {FP F X(a)}. 


Let S’ be the elements of S together with all functions definable over <G, R, 
F, S>. 
We define a F’ relation by giving it for atomic formulas over S’. If f, ge S’ 
we have three cases: 
(1). £geS. Let FF’'(fe g) if FF(fe g). 
(2). fe S, ge S’—S. Let re’(fe g) if Fe g(/). 
(3). fe S’—S. Let X¥(x) be the formula which defines f over (G, R, F, SD. 
Let rF’(fe g) if there is an he S such that PF~ (3x)~ (xe h=X(x)) 
and I'F’(he g). (This reduces the situation to case 1 or 2). 
We call the model <G, R, F’, S’> the derived model of <G, R, F, S>. 
Now, as above, let V be a classical model for ZF. We define a sequence of 
intuitionistic models as follows: 
Let <G, R, Fy, So» be any intuitionistic model satisfying the following five 
conditions: 
(1). <G, R, Fo, Sod € V; 
(2). So is a collection of functions such that if fe So, domain(/) S Sp 
and range (f) & B; 
(3). forf, ge So, TFo(fe g) ifand only if Fe g(/); 
(4). (extensionality) for f, g, he So, if [Tkyp>~(Ax)~(xef = xeg) and 
rey ~~(feA) then Pky ~~(geh); 
(5). (regularity) So is well-founded with respect to the relation x € do- 


main(y). 
Remark 1. If we consider the symbols v, a, ~, >, V,4,(,),6 1, X25 
x3,...to be suitably coded as sets, formulas are sequences of sets, and hence 


sets. It is in this sense that (1) is meant. 
Next, let <G, R, F,292 Sapam) be the derived) modelvor <G, Ri fps]: 
If A is a limit ordinal, let S, = (),<, S,. Let FF,(fe g) if for some a < A, 
PE,(fe g). Thus we have (G, R, F,, S,>. 
Finally, let S = J,ey S, and rF(feg) if for some «eV, Ft,(fe g). 
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Thus we have the ‘class’ model ¢<G, R, F, S>. 

At this point, let me remark without giving the proofs that the techniques 
which are used in handling the {M,} sequence have their analogues for the 
above sequences. As in the classical case we may show: 


THEOREM 2. All the axioms of Zermelo—Fraenkel (stated without universal 
quantifiers) are valid in any such <G, R, F, S>. 


Remark 2. As a special case, let Sp = 9 and let G have one element (then 
B has two elements, G and 9). If we define functions v,(X) = {If €G| 
Le,X} and we identify G with T and 9 with F, the resulting sequence <S,, v,> 
is identical with the sequence< M,, v,> above. Thus, as a special case of the 
above theorem, L is a (classical) ZF model. 

At this point it is possible to produce a particular (G, R, Fo, So» with so 
much symmetry built in that in the resulting class model (G, R, F, Sy ~ AC 
is valid. From this, as shown above, the classical independence of the 
axiom of choice follows. Since this model requires a fair amount of detail, 
rather than give it here I refer you to [15] and go on to show how ordinals 
may be represented in these models. 

By putting one more requirement on <G, R, Fy, So> it becomes possible 
to find quite satisfactory representatives of all the ordinals of V in the class 
models. Essentially, as in the classical case, each ‘ordinal’ will be the ‘set’ of 
all smaller ‘ordinals’. Let us make this more precise. 

We use a formula ordinal(x) with no universal quantifiers which classically 
defines the ordinals. 

Let us define ordinal representatives as follows: Suppose for each B < a 
we have already defined representatives in S, We call fe S a general represen- 
tative of « if 

(1) if ge S represents any ordinal <a, (gef) is valid in <G, R, F, SD. 

(2) if for some eG, IF(g ef), then for some R-successor A of P, some 

B <a and some he S representing B, AF(g = A), that is, AF ~ (3x) 
~(xeg =xeh). 
General representatives would be quite satisfactory to work with, if they 
existed, even if they were not unique. However, it is convenient to single 
out canonical representatives. 

If f is a general representative of «, we call f a canonical representative 
of « if 

(1) for no ge domain(/) and for no Pe G does 'F (f = g); 

(2) if FF ~~(gef), then FF (ge/f) for all g e domain(/). 
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For reasons to be given in a moment, canonical representatives, if they existed 
would be delightful to work with. To ensure their existence, we place two 
requirements on the 0-th models which say essentially that canonical re- 
presentatives in Sy (if any) are unique and that there are sufficiently many ca- 
nonical representatives in S, that any element of Sy which at some point is 
an ordinal may later be a canonical one. 


Formally, call <G, R, Fo, So> ordinalized if 

(1) no ordinal has more than one canonical representative in S,; 

(2) iffe S, and I Fy ordinal(f) for some I € G, then for some R-succes- 

sor A of F and some he Sy which is a canonical ordinal representative 
of some ordinal, 4 Fo(f= A). 
Now it is not difficult to show the following results: 

If <G, R, Fo, So> is ordinalized, 

(1) every ordinal of V is uniquely canonically representable by an ele- 

ment of S (denote the representative of « by @); 

(2) «= 6 iff (@ = f) is valid in <G, R,F, S, 

aep iff (@¢f) is valid in <G, R, F, S); 

(3) ordinal(@) is valid in ¢<G, R, F, SD; 

(4) ifforsome "eG, TI Fordinal(f), then for some R-successor A of I 

and some ordinal a, AF(f = @); 

(5) if the canonical representative @ is in S,4,;—S,, & is the function 

defined over the model <G, R, Fz, Ss> by the formula ordinal(x). 
Again we do not present proofs, but they can be found in [15]. Let me remark 
that making Oth models ordinalized is a natural requirement; models which 
are not are rather contrived things. 

There is an analogue to the classical notion of absoluteness: Call a for- 
mula X(x,,..., X,) dominant if for any f,,...,/, in 5, and any f eG, 
FER... +S,) iff PEL.£Gs,.+-J,). Formulas like (x ey), (x =y) and 
ordinal(x) are dominant, so whether a Oth model is ordinalized or not can 
be determined by considering it alone. 

Let cardinal(x) be a formula with no universal quantifiers, which clas- 
sically defines the cardinals. Then the following may be shown: 


THEOREM 3. Suppose <G,R,F,,S > is ordinalized and for some [eG 
and some ordinal x, IF cardinal(&). Then « is a cardinal in the model L of 
constructable sets. 


It is the opposite of this situation that is needed to show the independence 
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of the continuum hypothesis. A proof in Cohen may be adapted to these 
models to show the following: 

I’, A€G are called incompatible if they have no common R-successor. 
G is called countably incompatible if any subset of G of mutually incompa- 
tible elements is at most countable in V. 


THEOREM 4. If <G, R, Fo, So> is ordinalized, G is countably incompatible, 
and « is a cardinal of V, then cardinal(&) is valid in <G, R, F, S). 


Now a specific Oth model can be given which produces a class model in 
which ~(continuum hypothesis) and AC are valid. The model is essentially 
the same as the one in Cohen and the methods he uses can be adapted. 

We remark that constructible set representatives can also be defined and 
the proof of the independence of the axiom of constructibility of Cohen can 
be adapted to these models. 

I am afraid the foregoing has been only a collection of definitions and re- 
sults, without proofs. To give the proofs in detail would take pages. To give 
them in outline is to say they are the analogues of classical proofs or of proofs 
in Cohen. But now I am going to give even less details than before, only in- 
dicating the types of theorems that exist without stating them precisely. 

To the best of my knowledge there are three versions of the independence 
proofs (not counting those above): Cohen’s forcing technique, Vopénka’s to- 
pological method and Scott and Solovay’s Boolean-valued logic approach [9]. 

The connection between the above intuitionistic methods and those of 
Cohen should be clear to anyone familiar with Cohen’s work. 

Since there is a topological model theory for intuitionistic logic, there is of 
course a topological version of the above. In fact there is a direct translation 
between Kripke’s models and topological models, without going through 
the respective completeness theorems. I do not know how close translations 
of the above mentioned proofs would be to those of Vopénka. 

There are also pseudo-Boolean algebra models for intuitionistic logic [8], 
and again there is a direct translation between Kripke’s models and alge- 
braic ones. There are also some connections between pseudo-Boolean and 
Boolean algebras which apply in this case. Thus the above may be put into 
the language of Boolean-valued logics. The result is not the Scott and Solo- 
vay proof. We generalized the {M,} sequence, they generalized the {R,} 
sequence (sets with rank). Thus two more methods of showing independence 
become available, a Boolean-valued {M,} sequence and an intuitionistic (or 
forcing) {R,} sequence. Some details of both may be found in [15]. 
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AN ABSTRACT NOTION OF REALIZABILITY FOR WHICH 
INTUITIONISTIC PREDICATE CALCULUS IS COMPLETE 


H. LAUCHLI 


To each formula 4 of predicate logic and each assignment p of ‘proofs’ to 
the atomic parts of A we shall associate a set p[A], the set of ‘proofs of A’. 
The proofs of A > B are just the functions from p[A] into p[B]; the proofs 
of JvA are the pairs <c, x> such that x is a proof of A? (substitution of ¢ for 
v). 

Instead of ‘proofs of A’ we could as well say ‘realizing functionals for A’. 
In contrast to Kleene’s (second version of the) notion of realizability [3], 
we consider arbitrary, not necessarily countable functionals. 

We shall show that A is derivable in Heyting’s predicate calculus if and 
only if there is an explicitly definable functional © such that O € p[A] for 
all p, i.e. if and only if there is a well defined ‘proof’ of A which does not make 
use of the internal structure of proofs of the atomic parts of A. 

The ‘only if’ part will be clear from the known results about realizability. 
For the proof of the ‘if’ part we make use of the following analogy be- 
tween Kripke’s semantics for intuitionistic logic [4] and the theory of per- 
mutation groups. In the former, we can assert the implication 4 > Bina 
situation H iff in any later situation H’ where we can assert A, we also can 
assert B. In the latter, the following is true: Given sets A, B and a group H 
of permutations of the elements of A U B leaving A and B invariant. Then 
there is an H-invariant function from A into Biff any subgroup H’ witha fixed 
element in A also has a fixed element in B. 

The theorem will be established classically. The corresponding result for 
propositional logic was announced in an abstract [5]. 

Similar interpretations have been considered by Dana Scott (derived from 
Gédel’s Dialectica interpretation) and by Goodman, Kreisel, Troelstra, 
Scott (derived from the intuitionistic notion of ‘construction’; see [1]). To 
my knowledge, completeness of intuitionistic predicate calculus has not been 
established for any of these interpretations. 

ead 
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1. We consider formulas containing n-place predicate letters, a proposi- 
tional constant f (‘false’), individual constants, variables u, v, w,..., con- 
nectives A, v,—,4, V. We write A for 4 >f. F(I) denotes the set of 
all closed formulas with individual constants from a set I. 

In the following F and I are countably infinite sets, co e I is a designated 
element of [, Xx Y denotes the Cartesian product of the sets X and Y, 
X © Yis the disjoint union ({0} x X)U({1} x Y), ¥* is the set of all functions 
from X into Y. 

To each formula A, not necessarily closed, we associate a set S(A), the 
set of ‘possible proofs of A’: 

S(A) = I if A is atomic, 

S(AA B) = S(A)x S(B), 

S(Av B) = S(A)OS(B), 

S(A > B) = S(Bk™, 

Sipe) = Snr, 

S(avA) = I'x S(A). 

Note that S(4?) = S(A) for all individual constants c. Thus S(VvA) can be 
interpreted as the set of all choice functions which assign to each ce I an 
element of S(A®). 


A proof assignment is any function p which assigns to every (closed) for- 

mula Ae F(T) a set p[A] such that 

PL] S PIA] € 1 if A is atomic, 

plAa B] = p[A] xp[4], 

p[Av B] = p[A]up[B], 

p[A > B] = {xe S(A > B): xy € p[B] for all y € p[A]}, 

p[VWwA] = {x © S(WvA): xc € p[A?] for all ce T}, 

p[3vA] = {<c, x>: ceT and xe p[Ag]}. 
Note that p[A] < S(A) for all p and A. The elements of p[A > B] are func- 
tions with domain S(A). Thus the identity function on S(A) belongs to 
plA > A] for each p. 


2. Let & be the least class containing the sets {0, 1}, I, 7, such that when- 
ever D,,D,¢Q, then D,xD,, D,UD,, D,??¢Q. The elements of 
D,xD, are viewed as functions with domain {0,1}. Let ¥ = |) @. 
The elements of ¥F will be called functionals. Simple functionals are those 
which can be defined explicitly. The following kind of explicit definition 
will do: We consider terms built from constants 0, 1, cy and variables, using 
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the following formation rules: If ¢, s are terms and x is a variable and De J, 
then t(s), <t, s>, A?x(t) are terms. Terms are interpreted as follows: 

Let V be an assignment of functionals to variables. Then V[r] is the follow- 
ing functional: 


V[0] = 0; 
V(l] = 1; 
Vico] = ¢o3 


V[x] is the functional assigned to x; 
V[t(s)] is the value of V[t] at V[s]if V[z]is a function and V[s] belongs to 
its domain, V[z¢(s)] = 0 otherwise; 
V[X<t, s>] is the function with domain {0, 1} and values V[t] and V[s] 
at 0 and | respectively; 
V[A?x(t)] is the function with domain D, taking the value V*[t] for a € D; 
V~ assigns a to x and agrees with V otherwise. 
If t is closed (all variables bound by 4), then the functional V[t] does not 
depend on V. Simple functionals are by definition those given by closed terms. 


EXAMPLE |. Let A, B, C be closed formulas. Let 


D= S((A > C)A(B-> C)), 
E = S(Av B). 
Then the term 
Mx(A" y(<x0(y1), x1(y1)>(0))) 
defines a simple functional which belongs to 
pI(A > C)A(B> C) > (AVB> C)] 
for all proof assignments p. 
EXAMPLE 2. Let 
D = S(AV0(R(v) vm R(v))). 
Then 
Ha (e(darc( Cll dex(x)>))) 
defines a simple functional which belongs to 
S(>—V0(R(v) v > R(v))). 
3. Let o be a permutation on TU IT (i.e. a one-one function from Fu 


onto itself), which leaves invariant the sets I’, IT and the designated element 
Co. o extends in a natural way to a permutation on ¥: 60 = 0, ol = 1; if 
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g is a function then (og)x = o(g(a"‘x)). In particular o(a, b> = <oa, ob). 

A functional 0 e F is said to be invariant, if cO = © for all sucho. 
Simple functionals are invariant since cg, 0, 1 and all sets De J are 
invariant. On the other hand, there are uncountably many invariant 
functionals in ¥Y that are not simple. 


4. In the following A denotes a closed formula containing no individual 
constants other than cy. The symbol + denotes derivability in the intuitio- 
nistic predicate calculus. 


THEOREM. (1). Jf + A, then there is a simple functional © such that O € p[A] 
for all proof assignments p. 

(2). If not + A, then there is p such that p[A] contains no invariant func- 
tional. 


CoROLLARY. The following are equivalent (classically): 
(a). FA; 
(b) A simple O,Vp, © €p[A]; 
(c). Vpisimple@, © €p[A]; 
(d) J invariant O, Vp, O € p[A]; 
(ce). Vp J invariant O, O € p[A]. 


The corollary indicates a certain stability of the property ‘+A’. For condi- 
tions (b), (c), (d) and (e), which are quite different intensionally, turn out to 
coincide with ‘FA’ extensionally. 

If we drop the restrictions put on O, then we get classical logic in one case 
and an intermediate thing in the other: Vp i(arbitrary) O with O € p[A] iff 
A is derivable in classical predicate calculus. On the other hand, the condition 
3OVp holds for some intuitionistically invalid formulas, e.g. for 

Vv( R(v) v Q) > (WeR(v) v Q), 
but not for all classically valid ones, e.g. not for Ov — Q. 

Part (2) of the theorem is not true if only those p are considered with 
pf] = A (empty). The formula —— Vo(R(v)v R(v)), call it —=—B, 
gives a counterexample. For if p[f] = A then p[— B] = A since p[B] # A, 
B being classically valid. Hence p[—— B] = S(——B8). But S(——B) 
contains invariant functionals, as was shown in example 2. 

The proof of part (1) of the theorem is a routine variation on the proof of 
theorem 62, [2] p. 504. The remainder of this paper is devoted to the proof 
of part (2). 
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5. Let N be the set of the natural numbers (including 0). Let 2 be the set of 
all finite sequences of natural numbers (including the empty sequence 
A) together with an ‘ideal element’ U. Let R be the binary relation on Y such 
that sRs’ iff either s’ is U or s is a (not necessarily proper) initial segment 
of s’. Let ¥ be a function with domain Z and countable sets as values, such 
that whenever sRs’ and s ¥ s’, then Y(s) © ¥(s') and the complement 
Y(s’)— ¥(s) is infinite. We also assume Y(A) infinite. F(Y(s)) denotes the 
class of all closed formulas with individual constants from ¥(s). 

Everything introduced so far will remain fixed. In particular, we shall not 
vary the function ¥. 

A model is a binary function @(A4, s), where A ranges over F(Y(U)) and 
s over 2, whose range is the set {7, F}, and which satisfies the following 
conditions: 

(1). if (A, s) = T, then Ae F(¥(s)); 

(2). if (4, s) = T and sRs’, then &(A, s‘) = T; 

(3). if ®(f, 5) = T, then (A, s) = T for all A € F(¥(s)); 

(4). @(AAB, 5s) =T iff AABeF(¥(s)), and @(A,s)=T and 


C575) —si. 
(5). O(AvVB,s) =T iff AvReEF(¥(s)), and (A,s)=T or 
PCB, Sle 


(6). 6(4 > B)=T iff A> Be F(¥(s)) and for all s’ with sRs’, if 
CAs y— 7 then o(8, 5 ) =a: 

(7). @(WvA,s) = T iff (42, 5’) = T for alls’ with sRs’ and all c eY(s’) 

(S)mc(sods) = 7 git SiC AL ss) = 7 for some ce V(s). 


Lemma 1. Let A € F(¥(A)) with not + A. Then (and only then) there is a model 
® such that (A, A) = F. 


The proof is clear from Kripke’s work [4]. 
The element U is no bother: Any # which is defined on 2— {U} can be ex- 
tended to J by setting (A, U) = Tfor all A e F(¥(U)). 


6. In this section we establish a relationship between models @ and proof 
assignments p. 

Let g be a one-one function from J into the set of positive prime numbers. 
The function g from ¥ into N is defined by g(A) = 1, g(s#n) = 9(s)- 
- q(s#n), p(U) = 0 (s*n denotes adjunction of the last term n to the se- 
quence s). Let | denote the relation of divisibility. Each non-empty subset 
of 2 has a greatest lower bound (glb) with respect to R. For ne WN let 


232 H. LAUCHLI CXV 


5S, = glb{s: nlp(s)} (the set is non-empty since n|p(U)). For instance 
5 aes 

The following are simple consequences of the definitions: 

(1). e(s)lo(s’) iff sRs’, 

(2). nl@(s,), 

(3). nlg(s) iff s,Rs, 

(4). nj implies s,Rs 

(5). Sos)= 5) 

Let J denote the set of all integers, including the negative ones. Let n’ de- 
note the set of residue classes of J modulo n: 1’ is a one element set, 0’ is J. 
We have n’ nm’ = Aforn, me N,n ¥ m. 

We now define r = IT = |) {¥(s,)xn' :neN}. (More precisely: We 
impose a certain structure on the given countable sets F and IT. The struc- 
ture on I’ is isomorphic to that on JT. Thus, as a notational convenience, we 
identify [ with J7.) 

The designated element cy of I is to belong to W(s,)x1’. Let I, = 
L} {Y(s,) xk’: kn}. Thus Fr = I and cg eT, and 

(6). nlm implies [, < T,,. 

The elements of I are ordered pairs. If cel’, let c” denoteits first component: 

c é€W(s,) for some n. In virtue of (4), if k|n then ¥(s,) S Y(s,). Hence 
(7). cel, implies c” € ¥(s,). 

The converse is not true for all ce I’, but we have 

(8). ifde Y(s,) then there is cel, with c” = d. 

If Ae F(I), let A” denote the formula obtained by replacing each individual 
constant c by c . 

(9). Ae F(I,) implies AT € F(¥(s,)). 

Let o be the following permutation on I: If c = <d, i/n> € ¥(s,) xn’, then 
ac = <d, (i+1)/n>. Then, for all ce I and ne N we have 

(lO)Me'c=¢ if tce ae 
In particular acy — 7c, since cyeuiq. 

To each model ® we associate a proof assignment p as follows. Let 
Ae F(L) be atomic. We define 


PIA) = Aes) xe Ose sion Of et 


The requirement p[f] S p[A] < ZH for atomic 4 is satisfied. p[A] only de- 
pends on A . This carries over to compound formulas. Hence for all 
A, Be F(T) 

(11). A~ = B” implies p[A] = p[B]. 


my 
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Furthermore it is easy to see that 
(12). o(p[A]) = p[A], for all 4A in F(D). 


LemMa 2. For allne N and Ae F(I,,), 6” has fixed elements in p[A] if and 
only P(AG, s,) = 7 

(Thus we have a world constant o such that for all ® there is a p such that 
lemma 2 is true.) 

Proor. The proof is by induction on the complexity of A: 


1. Let A be atomic. By (10), o” has fixed elements in p[A] iff T, - p[A] 
# A, i.e. iff there isa k, k|n, suchthat ®(A,s,) = T or O(f, 5,) = T. By (4), the 
latter holds iff (A ~,s5,) = T or ®(f,s,) = T. Since Ae F(I,) (9) gives 
A e€F(¥(s,)). Thus ©(f,s,) =T implies 6(A_,s5,) = T. Hence o” has 
fixed elements in p[A] iff (4, s,) = T. 

2. Let A be BA Cor BV C. The induction step is clear. 

3. Let A be BC. ‘only if’: let x € p[A] with o"x = x. Consider s with 
s,Rs and @(B',s) = 7. Let m= g(s). Then njm by (3), and hence 
Be F(I,,) by (6). By (5) sis s,,. The induction hypothesis gives an element 
y €p[B] with o"y = y. Now xyep[C] and o(xy) = (o"x)(o"y) = xy 
and CeF(I,,). The induction hypothesis gives ®(C_, 5s) = 7. Hence 
@(A-,s,) = T. 

if’: Let ®(A_, s,) = T. If nlm then s,Rs,, (by (4)) and hence (B, s,,) = 
= T implies @(C ,s,,) = T. By the induction hypothesis for each m with 
n|m, if o” has fixed elements in p[B], then it also has fixed elements in p[C]. 
Let H be the permutation group generated by o”, and let H(y) denote the 
largest subgroup of H leaving y fixed. Since any subgroup of H is generated 
by o” for some m with n|m (at worst m = 0), we have that for each y € p[B] 
H(y) has fixed elements in p[C]. 

We say y, is equivalent to y,, ifhy, = y, for some he H. Let S be a maxi- 
mal set of pairwise non-equivalent elements of p[B]. Let g be a function 
from S into p[C] such that gy is fixed under H(y) for all ye S. Let x, = 
= {<hy, h(gy)>: ye S and he H}. x, represents a function: If hy = h'y, 
then h~'h' e H(y) S H(gy), whence A(gy) = h'(gy); dom(x,) = p[B] be- 
cause of the maximality of S and the invariance of p[B] (see (12)). Also 
re(x,) S p[C]. Thus x, is an H-invariant function from p[B] into p[C]. 

Recall that the elements of p[B > C] are functions with domain S(B), 
the set of ‘possible proofs of B’. Let x, be any invariant function from 
S(B)—p[B] into S(C), for instance the constant function with an invariant 
element of S(C) as value (which is easily seen to exist, since both F and IT 
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contain the g-invariant element cg). Then x = x,Ux, belongs to p[B > C] 
and is invariant under o”. 

4. Let A be VoB. ‘only if’: Let x e p[VWvB], ox = x. Let s,Rs, de Y(s). 
As before, we have n|m and s = s,, form = ¢(s). By (8) there isc eT, such 
that c_ = d. By (10) oc = c. Therefore o”(xc) = xc. Also xc € p[B?] and 
Boe F(I,,). Induction hypothesis gives ((B2) ,s)=T. (BZ) is (B)j. 
Hence @(VoB’, s,) = T. 

uf; Let ©(VvB .7s,)— Po wletrcees lf isthe group generatcdeps 
o", then H(c) is generated by o” for some m with n|m. By (10) ceT,,. By 
(7) c” € Y(s,,). By (4) s,Rs,,. Therefore ©((B2) , s,,) = T. Also BZ € F(I,,). 
By induction hypothesis H(c) has fixed elements in p[B?]. As before we 
get an invariant function. There is no trouble with the range, since for all 
h eH, h(p[B2)) = p[B2] = p[B;.] in virtue of (12) and (11). Therefore o” has 
fixed elements in p[VvB]. 

5. Let A be 4vB. The proof is straightforward, using (7), (8) and (10). 

This concludes the proof of the lemma. 


Proof of part (2) of the theorem: 

Let A be a closed formula containing no individual constants other than 
Cg. Then A € F(I',) and A” e F(¥(s,)) = F(Y(A)). Assume not +A. Lemma 
1 gives a model @ such that ®(A, A) = F. Let p be the proof assignment 
associated to ®. Then by lemma 2, o has no fixed element in p[A ]. Therefore 
p[A] contains no invariant functional. 
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EXTENDING THE TOPOLOGICAL INTERPRETATION TO 
INTUITIONISTIC ANALYSIS, I 


DANA SCOTT* 


This paper is a sequel to the paper [12] written for Professor Heyting 
under the same title. Nearly all of the questions left open in [12] have been 
answered. In particular the results of section 3 in [12] having to do with 
universal formulae of the theory of < in three variables have been extended 
to arbitrary universal formulae in section 5. (Our numbering of sections con- 
tinues that of [12].) We then discuss in section 6 the general metama- 
thematical implications of the method of section 5 for the theory of the 
topological model of intuitionistic analysis. In section 7 the important step 
is taken of enlarging the model to encompass arbitrary (extensional) real 
functions. The main result is the verification in the model of Brouwer’s 
theorem on continuity: all functions are uniformly continuous on closed in- 
tervals. The proof is given in detail along with several related results. (The 
reader will have to refer to [12] for notation and the definition of the model.) 

The author was thus able to conclude this paper feeling that he had a 
rather good grasp of the basic properties of the real numbers of the model. 
Several further projects remain to be carried out, however. The next im- 
portant step is to discuss the corresponding topological interpretation of 
second-order arithmetic and the theory of free-choice sequences of integers. 
This will make possible an exact comparison of the theory of the model and 
the usual axiomatic theories of intuitionistic analysis (which will no doubt 
be one of the main topics of part III of this series of papers.) Following such 
work it is obvious that attention must be given to obtaining a constructive 
version of the model. Kreisel has suggested that the theory of constructive 
and lawless sequences (the system of [8]) may provide the proper framework 


* Research on this paper has been supported by a grant from the National Science 
Foundation. Special thanks are due to Professor G. Kreisel for the many hours of dis- 
cussion on intuitionism in general and the helpful criticisms of earlier drafts of this paper in 
particular. 
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for this discussion in view of the connection between the topological inter- 
pretation and lawless sequences [5] (part IV?) In this connection it is appro- 
priate to take this opportunity to retract a remark of [12], p. 194. Contrary 
to what was said, Kreisel has all along felt that the topological interpre- 
tation has relevance. Indeed in [6] he mentions it along with realizability and 
his own elimination of free choice sequences. Moreover in [5], p. 370, para- 
graph 2, he had already presented the clearest possible statement of his view 
of the interpretation. 

In another direction the theory of species and higher-order functions will 
require consideration. That study will of course bring in connections with 
the Boolean models for set theory ([11] and [13].) To date no particular 
technical results about the Boolean models have played a role in the dis- 
cussion of these topological models. In fact, the main effort was given to 
the consideration of those properties of the models and topological spaces 
most appropriate to foundations of intuitionism. However, it was the success 
of the Boolean models in making clear certain aspects of Cohen’s remarkable 
independence results (such as the type of construction of generic and ran- 
dom reals) that encouraged the author to take up again the study of the topo- 
logical interpretation which he had put down some ten years before. And he 
is most happy to see now just how well things fit together. It does not seem 
altogether impossible that work in the intuitionistic theories may throw 
additional light on the properties of the Boolean models (cf. [2] and other 
references given there). 


5. Further independence results. In section 3 we discussed universally 
quantified three-variable consequences of the axioms of order (1.1}-(1.2) 
and noted that such questions could be thrown back to propositional 
calculus as well as being determined by the model. At the time of 
writing the earlier section the results were rather special, and the author 
suggested that they should generalize. They do. Kreisel has pointed out 
the relevance of the following well-known fact about HPC (Heyting’s 
predicate calculus): 

A universal sentence is a consequence in HPC of a given universal axiom 
if and only if its matrix is a propositional consequence of a finite number of 
substitution instances of the axiom using the variables mentioned in the con- 
clusion. 

Note that our language has only relation symbols, in fact only <, which 
makes the result so simple; in particular, we have a decision method for 
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universal consequences of a universal axiom. The proof of this general 
result requires no special knowledge of models. 

Let us consider the theory of <. We can schematically indicate the gen- 
eral universal sentence as 


(oan. ... Ul Caices Rae D 


where we mean to suggest that the expression involves the various atomic 
formulae x; < x,. Let P;; be propositional letters and let 


ACE lg <1) 


indicate the result of replacing x, < x, by P,; in the matrix of our universal 
Sentence, Further let 


B(P,j: i,j <n) 
be the conjunction of all formulae of the two forms 


and 
[Pi; > Pau V Prj] 


for i,j,k <n. In our case the general result mentioned above shows that 
the given universal sentence is a consequence of (1.1}-(1.2) if and only if 


(SCP een Al. i, ) < 7) | 


is a propositional theorem in HPC; hence the decision method. This approach 
is definitely only suited to universal prenex formulae, however. The author 
is rather inclined to believe that the full first-order intuitionistic theory based 
on (1.1}+(1.2) is undecidable. 

Having noted the above purely formal result, we may now ask: Are the 
axioms (1.1}-(1.2) complete for the universal sentences valid in our model? 
We shall prove that, under suitable conditions on the topological space T, 
if the universal sentence is not provable in the theory of <, then it fails in 
the model. Indeed we shall find that the contradictory sentence 


ea Ay < Xt <= Hn) 


is actually valid in the model. For this purpose in view of [10], pp. 130-131 
we find it convenient to assume that T is a non-empty, dense-in-itself, to- 
tally disconnected, metric space; the two best known examples being the 
Cantor space 2” and the Baire space NV’. We prefer the latter because of 
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its naturalness in interpreting second-order number theory. Thus for the re- 
mainder of the paper we may as well fix 7 = N%. In the future there may 
be reason for greater generality, but this choice of T gives a complicated 
enough model with sufficiently many interesting properties. 

Suppose, then, that the universal sentence is not provable from (1.1)— (1.2). 
Thus the corresponding propositional formula is not provable either. By 
what we know about T = N™, we can assign open sets [P;;] to the P;,; in such 
a way that 


NBEO ee c8 0 ce, (a)| a 18% 
while 
LA(P;;: i,j < n)] # T. 


We wish to find continuous functions €;¢ & such that 


[é; < ¢,] = LP], 
and this will give us the counterexample in the model. 

It seems convenient to construct certain auxiliary functions before ob- 
taining the €,. In the first place, given open sets [P;,] and given a subset 
Ic {0,1,...,}, we can define continuous functions 6, € & that are non- 
negative and such that 


[or > 0} = () () [Pal- 


kel I¢l 


Next, for given i, 7 < n, we define z,,€ & such that 


where the summations run over all the subsets satisfying the indicated re- 
strictions. Finally we set for i < n 


Ci = Tors 


but it takes several steps to see that these functions so defined have the desired 
properties. (These formulae, by the way, do indeed generalize those of 
section 3.) 

We note first that, fori, j <n, 


and 
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hold by definition. The z,;; are by no means non-negative, but we shall see 
that 


[z;; > 0] = [Pi]. 
Assume that z,,(t) > 0, then by the definition of the z,,; and the non-nega- 
tive character of the o,, we find o,(t) > 0 for at least one subset J such that 
ie/,j¢J. But then by the construction of o,, we have re|P,,]. Now 
for the converse, assume that te[P;,;]. We must obtain a particular 
Ic {0,1,...,2—1} such that ie/, jé/, and 


vey () LPul: 


kel 1¢I 


where of course k, / range over all indices less than n satisfying the restric- 
tions. To construct J we proceed by induction. Let J) = {i} and Jy = {j}. 
Since 


[Py] 0 [Pj] = 9 
we know that i # j, because te [P,,]] but ¢ [P,;]. Hence I) 0 Ig = 9 and 


te) {)\ [Pull 


keIg leIo’ 


Suppose now g <n, and J, I, = 9, and 


ery 10 (er: 


keIg lel,’ 


Let m be the least integer where m <n, m¢I, O1,. We will show how to 
adjoin m to I, or to I; preserving the above properties. Because, from the 
assumptions about the P;,;, we have 


[Paul S [Pum] V [Pol 

so that 

QO LPal S CA) Pend UO) LPmul). 

keIg lelq’ kelg leg’ 
Now if t€ (ser, [Pim], set Iga, =J, and Ij,, =I, U{m}. While if 
te (\ter,: LPmil, set [41 = 1, {m} and I), = Ij. The required proper- 
ties are obvious in either case. Proceeding in this way to qg = n (or is itn—2 
since we started with i, j as given?) we obtain J and its complement as desired. 
Having found this J we see o;(t) > 0. Now if J is any subset with j € J and 
i¢J, then since t¢[P,;], we have o,(t) = 0. We have thus proved that 
n,,(t) > 0. 
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To complete the proof we must prove in addition 
Tye Ry t Nyy = (0). 


Note that in view of our above remarks about the 7,,, it is sufficient to assume 
the i, j, k < n are all distinct. We must, unfortunately, write out the sum on 
the left-hand side of the above equation, namely: 

Pig ews iar )) op a inser 

iel jel jel ket kel ieJ 

jel i¢y kgl i¢d i¢l kes 
There are two kind of terms: positive and negative, and there are equal 
numbers of each. Thus if we can show that each positive term can be can- 
celled by a unique negative term that is formally equal to it, then the whole 
expression cancels to zero. In view of the symmetry of the situation, consider 
a typical term o, with i € Jand/ ¢ J. By inspection, of the six kinds of restric- 
tions on the capital letter subscripts in the expression only two could con- 
tain an occurrence of the same subscript J. And actually only one can arrise: 
the cases k € J and k ¢/ are disjoint. Thus indeed a positive occurrence of 
o, cancels with a unique negative occurrence with the same subscript. Our 
argument is complete. 

From the equation just proved we see that 


epee = hy 
[e: < ¢;] = [Py]. 


WE Sgr, PSMA) Me 


Since the only interesting situation is where n > 2, we can derive exactly as 
in section 3 the validity in the model of 


which implies that 


It follows that 


VX Gla a2, ne ee i et) (5.1) 


which gives us a large number of independence results. What we would like 
to discuss next is whether there are reasonable axioms for intuitionistic 
analysis from which these formulae (5.1) become provable. 


6. Maximality of propositional calculus. In classical logic we have a very 
easy time of it: an unprovable propositional schema leads at once to contra- 
diction after a suitable substitution of quite trivial formulae for the pro- 
positional letters. Thus we know in a most direct way that in our formaliza- 
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tion no valid principles have been overlooked. The predicate calculus is 
not that simple, otherwise it would be decidable. Nevertheless, if we con- 
sider a theory such as that of all (classically) true sentences of first-order 
arithmetic, we conclude from Gédel’s theorem that each unprovable pre- 
dicate calculus schema has an instance in the language of arithmetic that is 
indeed false. Let us formulate this idea as a general metamathematical no- 
tion. 

On the one hand we have a Jogical calculus (classical or intuitionistic 
propositional or predicate calculus, to be definite) which consists of various 
schemata and some rules (maybe, only modus ponens and generalization.) 
On the other hand we have a theory which consists of a set (species) of valid 
formulae in a certain (applied) language. We assume that we know what it 
means to form an instance of a logical schema in the applied language. Ob- 
viously we want to assume that the logic is sound for the theory; that is, all 
instances of schemata are valid and all instances of rules lead from valid for- 
mulae to valid formulae. What is more interesting is the question of whether 
the logic is maximal for the theory. This means that if we take a schema not 
provable in the logical calculus, then the adjunction of all instances of this 
new schema render the given theory inconsistent in the sense of the original 
logical calculus. (We can take the inconsistency to be the resultant provability 
of arbitrary formulae.) We use the word ‘maximal’ here as being more des- 
criptive than the overworked ‘complete’. As pointed out, classical proposi- 
tional calculus is maximal for any (classical) theory. Classical predicate 
calculus is maximal for certain theories, though not for any recursively enu- 
merable theory. What about the intuitionistic logical calculi? 

An analysis of the argument of section 5 shows that atomic formulae of 
the form y > 0 have instances which, in the model, take on any value (any 
open set). Thus with our choice of T = N% we can easily argue that if 
C(P;: i < m) is a propositional formula not provable in HPC, then 


Vor aer CU, S0sn =< m) (6.1) 


is valid in the model. Note that the negation falls outside the scope of the uni- 
versal quantifiers. Thus no instance of the unprovable C(P;: i < m) is likely 
to lead to a propositional contradiction (unless it is classically invalid.) 
However, an obvious instance combined with universal generalization does 
lead to a contradiction. Therefore we can say that the propositional part of 
HPC is maximal for the theory of valid sentences of the model in the context 
of all the rules of HPC. Whether the full HPC is maximal for this theory the 
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author cannot see at the moment; though this may in fact follow from the 
known topological completeness proofs for HPC. 

A question that should be answered concerns those axiomatic theories 
of analysis for which we have the same HPC maximality. Kreisel has veri- 
fied in [5], p. 378, propositional maximality for a theory of lawless sequences. 
The theories of Kleene—Vesley [4] and Kreisel—Troelstra [14] need investi- 
gation on this point. The many fragmentary results of [4] point in this di- 
rection; and one should consider in this connection whether the full VaIf- 
continuity is really needed or just Vad!f. Another question: could there 
be any intuitionistically acceptable extension of Heyting’s arithmetic (HA) 
for which we would have propositional maximality? It seems unlikely. 

From the axiomatic point of view, propositional maximality has certain 
useful consequences. We shall now show that (6.1) together with some very 
elementary algebra allow us to formally derive the results of section 5. Hence, 
if in [4] it would have been possible to obtain (6.1), many of the results of 
[4], ch. IV would have followed directly. 

Let us recall the notation from the previous section: B(P,;: i,j <n) 
stood for a certain conjunction related to the axioms of order; A(P;;: i,j < 7) 
was an arbitrary formula such that 


[B(P;;: i,j <n) > A(Pij: i,j < n)] 
was unprovable in HPC. Let p,; be distinct real variables for i,j < n. Our 
construction of the €; in section 5 could be turned into a formal proof of 
B(py > 0: i,j <n) 3x0,--.,X%n-1 A [i < x; Diy > 0), (6.2) 
i,j<n 
where /\ is the sign of conjunction of several formulae. The proof would 
be rather long: Assuming the left-hand side, we would have to first introduce 


Py = max(p;; 0), 


to get non-negative variables, and then invoke the elementary theorems to 
have 
de 
Piz > OF pj; > 0. 


But that is easy. Next we would have to introduce for / ¢ {0,1,...,n—1} 


& = [1 Dl ets 


kel W417 


where | | is the sign of arithmetic product. Elementary algebra tells us that a 
product of non-negative terms is strictly positive if and only if a// the terms 
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are strictly positive. Thus the formal s, correspond to our use of o,; in the 
informal proof of section 5. Then we could introduce p,; corresponding to 
the z;; and then the x; corresponding to the ¢;. Again by elementary al- 
gebra we could transcribe our proof to finally establish (6.2). 

Now we invoke (6.1) and the unprovability of the implication combined 
with (6.2) to prove 


SV Xoses Mean Ae, X21 <n), (6.3) 


which is not quite as strong as what we validated in section 5. But note that 
[x; < x; x;+y < x;+y] by elementary algebra. This translation in- 
variance makes it easy to prove, as in section 3, the following 


eee A es Voile Ti lear VY aren Kees A(X X;° ffeil), 
(6.4) 


where on the right-hand side we have left x, free. Obviously the implication 
goes from left to right; for the converse one just translates the fixed x9 to 
an arbitrary x). The reader can surely do this for himself. That is how we 
get the formal proof of (5.1). 

The deduction just outlined illustrates a familiar phenomenon: a result 
purely in the theory of one predicate (in this case it was <) was obtained 
only with the aid of several auxiliary notions (viz. +, -, —, max, min) to- 
gether with their elementary properties. In classical theories we have exam- 
ples in various fragments of integer arithmetic but not in the theory of order 
in the continuum. This is another indication that the intuitionistic theory of 
< is more complicated than the classical theory. 

To further illustrate the usefulness of the auxiliary notions for under- 
standing the basic theory of < we may mention the definitions of z e€ [x, y] 
and z € (x, y) given in (4.2)}-(4.3). These are equivalent to the definitions in 
[3] (3.3.2.1, p. 40), and we find in [3], p. 41 a proof that 


z é [x, y]  min(x, y) < z < max (x, y). (6.5) 


That obviously makes very good sense and clears up the doubts the author 
had when he wrote section 4. However, in the case of the open interval the 
best we can do is 


z€ (x, y)<+ —— [min (x, y) < z < max (x, y)]. (6.6) 


The author is only very slowly beginning to understand the role of —— in 
intuitionistic mathematics, and hopes to discuss it again at another time. 
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A different kind of consequence of the method used to verify (6.1) shows 
us that for arbitrary formulae A 


Iy[y> 00 A] (6.7) 


is valid in the model. This is a strong version of Kripke’s schema first men- 
tioned in print by Myhill [7], p. 174. Myhill’s weaker statement of the prin- 
ciple would read in our notation: 


Jy[y < 0- A Alaly> 0-4). 


Clearly this statement follows from (6.7) but one would guess not conver- 
sely. Assuming that the system of analysis supplied by our model is at all 
reasonable intuitionistically, we thus obtain a consistency proof for this 
form of Kripke’s schema. As Myhill has already pointed out, (6.7) is incom- 
patible with Vaif-continuity ([7], pp. 173-174). Thus to justify the reason- 
ableness of our model, we must now turn to a discussion of what continuity 
properties actually are valid since such principles are desirable intuitionisti- 
cally. 


7. Functions and continuity. We are now going to enrich our language of 
analysis by the adjunction of function variables. We reserve f, g, h (with or 
without subscripts) for such variables and employ the usual function-value 
notation f(x) in the formal language. We are assuming our functions to be 
everywhere defined, real-valued functions of one argument. The extension of 
the language and the results to functions of several arguments seems to 
present no difficulties and is left to the reader. 

The interpretation of these functions in the model will be given in a very 
Straightforward way and will lead to several obviously basic principles 
being valid. In the first place our functions will be extensional: 


Vf¥x, yx = y > f(x) =f); (7.1) 


and apartness and equality of functions will be given by the following defi- 
nitions: 
VA OLS # g Ax f(x) # 9(x)], (7.2) 


Vi gLf =9- af #9]. (7.3) 
Next we will have the principle of function existence: 


Vxd!ly A(x, y) > 3Yfvx A(x, f(x), (7.4) 
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where A(x, y) is an arbitrary formula which is assumed to be extensional (as 
are all of our formulae for the time being) in the sense that 


Veer yy IAG Wax =~ Ap—y = 40.7) 


must be valid. 

There are no surprises here, since (7.1)-(7.4) are clearly rather fundamen- 
tal for any reasonable theory of functions. What is surprising is that, without 
seeming to have built the fact into our interpretation, we do validate the 
principle of continuity: 


VfV¥z, we > 04d > OVx, ye[z, w][[x—yl < d > |f(x)-f(y)| < e],(7.5) 


where we have used e and das rational variables in place of the conventional 
é and 6. Note that (7.5) is not just point-wise continuity of every function 
but uniform continuity on every closed interval. 

A simple consequence of uniform continuity is: every rational-valued 
function is constant. A moment’s reflection shows it is enough to prove that 
every integer-valued function is constant; and, even simpler, it is sufficient 
to prove that every two-valued function is constant. This last principle has a 
simple schematic statement: 


Vx[A(x) v 5 A(x)] > [VxA(x) v Vx — A(x)], (7.6) 


a principle which states, in Brouwer’s terminology, that the continuum is 
‘unzerlegbar’ (cf. [4], *R10.4, p. 155). The proof of (7.6) begins with the 
introduction of a function f such that 


Wx[[A(x) Af (x) = O] v[— A(x) af (x) = 1], 


which is justified by (7.4). Then one applies (7.5) with e = 1 for an ar- 
bitrary pair of numbers z, w. The interval [z, w] can easily be covered by 
intervals of length d, where d is supplied by (7.5), and the inequalities 
together with an inductive argument imply that fis constant on [z, w]. 
It is tempting on rather formal grounds to assume a version of the axiom 
of choice: 
VxdyA(x, y) > 3fvx A(x, f(x)), (AC-RR) 


where we have followed Kreisel [8], pp. 233f, in style of terminology with RR 
meaning that the choices are from reals to reals. Without thought, (AC-RR) 
would seem to be a reasonable strengthening of (7.4); however, in view of 
the remarks in connection with (7.6) we see that it is actually invalid for the 
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extensional notion of function intended here. Indeed the formula [x < ya 
y €Q] gives the obvious counter-example. 

It is quite pleasant that to be able to achieve all the above results the in- 
terpretation in the model can be as simple as possible: we take as functions 
those mappings 9: # > & such that for €,47 eB we have 


[é = n] s [e() = o@)]. 


This is the extensionality principle: we make (7.1) valid by definition. We 
let @* denote the class of all these extensional functions 9: # > &. 
Further we define for 9, ye 2? 


[9 4 ¥] = U lol) 4 ve). 


This convention makes (7.2) valid also by definition; similarly for (7.3). But 
this is all that is trivial by definition; the remaining facts require proof. 

To prove the validity of (7.4) it is clearly enough to prove existence of f, 
because the uniqueness will follow in view of the hypothesis and (7.2-7.3). 
Thus we must show that 


[Vxa!yA(x, y)] S [SfV¥x A(x, f(x))]. 


We will take advantage here of the fact that T = N% is totally disconnected 
(though no doubt all we really need to know is that T is metric.) In such a 
space every open set is the union of its clopen subsets. Hence it is sufficient 
to prove for every clopen set K ¢ T that if 


K & [Wx3!y A(x, y)], 
then 
Ke apy A(x) 


Indeed, to prove this last inclusion, we need only find a function 9, ¢ #* 
corresponding to K such that 


K & [Vx A(x, @x(2))]- 


The advantage of working with clopen K is that for te K and €€ # we 
will have a special way of defining 9x(€)(t), but for t¢ K we can simply set 
(x(€)(t) = 0. The clopenness of K will assure that ox()€ & so defined is 
continuous. 

Thus let €€ & be fixed. Our assumption implies that 


Kc [3lyA(é y)], 
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To define o,(&) we use the same method as in section 2 and set 


Ox(S)(t) = inf {reQ: te [ayLA(e, y)ay < r]]} 


for té K; otherwise the value is 0, as mentioned above. As in the proof 
of (2.7) it is elementary to check that 


Kn 9x(é) ‘(9,4') = Ka [Ay[A(E y)ag < y < 4'T], 


which shows that o,(€) is continuous on K. Therefore g,(&) is continuous 
everywhere in T. This does not yet prove that o, e &. We must still verify 


Ko [€ =n] S [ec = ox(n)]; 


but this easily follows from the extensionality of A. Since gx(€) is 0 outside 
of K, we see that indeed o, € &*. Finally it is clear by construction that 


i e= [ACE ox(€))], 


and the desired conclusion follows. 
The proof that (7.5) is valid is by no means as straightforward. Note 
first that (7.5) implies 


VfVx, YL (x) # f(y) > x # I, ea) 


which is a stronger version of (7.1). As it turned out, it is very useful to 
validate (7.7) first as a lemma, and then establish (7.5). (This point was a 
stumbling block for the author, who formulated the problem in a lecture at 
the University of Texas at Austin. Shortly thereafter the idea of the following 
proof was communicated to the author by Professor Jack Hardy, whose 
assistance it is a pleasure to acknowledge.) 

To prove (7.6) we note first that what we have to establish is this inclusion 
for €,n€ and pe": 


[o(S) 4 o(n)] s LE # a]. 


Now after translation back to subsets of T, we can complement both sides 
of the inclusion to obtain: 


{te T: O(t) = n(t)} S {te T: e(€)(t) = o(m)(d}- (*) 


That is what we must prove; what we are assuming (because g € &*) is 
equivalent to: 


In {te T: &(t) = n()} S {te T: PA) = oln(O}- (+8) 
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We can state the problem in words: (#) means that @ is an operator on con- 
tinuous functions (to continuous functions) such that if two arguments 
é, 4 are equal at a point, then their images g(€), p(n) are equal at the same 
point. Thus (&)(t) only depends on ¢ and &(t), not on the whole of €. It 
would seem that (##*) is much weaker: o(&)(t) depends on knowing ¢ and 
knowing € in some neighborhood of t; that is, on the local behaviorof € around 
t. (The author imagined at first that one could have a function @ satisfying 
(#*) without its satisfying (*). It seems odd that a problem like this has not 
been considered before, but inquiry failed to uncover any relevant infor- 
mation. It would fair to say that most of the operators considered in mathe- 
matics usually depend on the whole of & (like integration) or do zot give 
continuous functions (é) as values (like right-hand derivatives).) 

The first main step in deducing (*) (for all €,n€ @) from (**) (for all 
é,n € @) is to remark that the set on the right-hand side of (**) is closed 
because p(€) and ¢(7) are each continuous. This means that we can strengh- 
ten (##*) to read: 


ClIn {te T: €(t) = n()} S {te T: (2) = o(n\(D}, (+#*) 


which we are assuming for all €, n € 2. Now we must prove (*) for a parti- _ 
cular pair € 9,4) € &. Suppose ty € T is a point such that €9(to) = no(to). 
We will show below that we can find open subsets U, V of T such that 


CCU ia Cli — as 
and such that & is bounded on U and np is bounded on V. Now the function 
Co = of C1(U) U m0 [C1(V) 


is obviously continuous on Cl(U) U Cl(V) because &9(to) = no(to), and it 
is bounded. Thus by the well-known Tietze extension theorem (T is a metric 
space) we can extend the function to the whole of T. We call the extension 
Cy also. (This construction of C9, which is due to Prof. Hardy, is the main idea 
of the proof.) Now we make use of (###*) for €, C5 and for €5, yo. In 
the first instance we see by the construction of £, that: 


f © Cl in (re fc,(t) — Ca), 
and so by (###*), @(€)(to) = 9(fo)(to). In the second instance we have 


toe¢ Clin {te T: C(t) = no(t)}, 
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and so again by (*##*), @(C5)(to) = @(No)(to). Now it follows at once that 
(Eo)(to) = 9(No)(to), Which is what we needed to establish (*). 

Filling in the part of the argument about U and V that we had postponed, 
we note first that if fo is isolated we can take U = V = {t,}. Assume then 
that fy is not isolated and let t,, 4, f3,..., ¢,,... be an infinite sequence of 
distinct points converging to fg. Inasmuch as € and #9 are continuous, they 
both must be bounded on {t., t,,...,t,,--..}. Around each of these points 
we can put small open spheres with ¢, € S,, such that the closures of the spheres 
are pairwise disjoint (J is a metric space!) and on which &) and yp are uni- 
formly bounded. It follows that 


U a U Ry 
n=1 
and 
V= U San=t 


are the required open sets. 

We shall now investigate the nature of the functions g ¢ #* more closely. 
The reader will appreciate that it is necessary to keep his wits about him in 
thinking about the model. We already use functions € : T + R to play the 
role of real numbers in the model, so that when we come to functions in the 
model we have to use operators 9: 2 > &. Thus ¢(€) is a function for 
each €e #, as we had to consider in the above proof of (7.7). Looking 
from the outside, the type of the objects used in the model is always higher 
than the formal type of the variable which ranges over them. This process 
of going up in type for the definition of the model is very convenient for 
making the construction simple, but we shall now show that a reduction in 
type is possible helping in the analysis of the model. 

Since g € #* satisfies (+) we define 6 : Tx R > Rby the condition: 


P(t, a) = b iff for some Ee Z, €(t) = a and o(€)(t) = B. 
We thus have @ well-dertermined and such that for é¢ Z, teT: 
o(SKt) = P(t, &(2)). 


The point of bringing in @ is that, as we shall now show, the function @ is 
continuous on the product space T x R. We argue bycontradiction: suppose 
that @ is not continuous at some point (¢, a). Then there must exist ane > 0 
and a sequence of neighborhoods U, of t and a sequence of reals 6,, such that 
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the U,, monotonically decrease to {r} and the 6, to 0. Further we can find 
a sequence of points (¢,, a,,) where rt, € U,, and |a,—a| < 6,, and 


|®(t,, @,)— P(t, a)| > € 


for all n. Since for fixed a,, a the two expressions (1, a,) and (t, a) repre- 
set continuous functions of t, we can clearly assume that all ¢, 4 t. (Other- 
wise we could ‘adjust’ z, € U,, and still preserve the above strict inequality). 
Furthermore, because the diameters of the U, are converging to 0, we can 
assume that the ¢, are all distinct. (If not, we can choose a suitable subse- 
quence.) After that fuss, we can now remark that we are able to construct 
a continuous function €€ # such that €(t,) = a, and €(t) = a, because 
the a, do converge to a. But then since ~(€) is continuous, (€)(t,,) would 
have to converge to g(€)(t), which contradicts the above inequality. 

Now that we see why © is continuous, we can quickly establish the 
validity of (7.5) for . Note first that it is sufficient to consider z = q and 
w =r, where g,reéQandq <r. Lete > 0,¢€ Q. We must show: 


iE ee ee {re T: [E(1), nO e[¢, r] aléQ—-n@| < 6 > 
| P(e, ())— (4 n(0))| < eh. 


Introduce an auxiliary function e:Tx (0, 00) > R by: 
e(t, 6) = sup {|®(z, a)— (1, b)|: a, be[g, r], Ja—b] < 5}. 


The function inside the sup is a continuous function of three variables ¢, a, b, 
and the sup is being taken over a compact subset of Rx R. It follows for a 
fixed 5 that e(t, 5) is a continuous (and well-defined!) function of t. Since 
for fixed ¢ the real function ®(¢, a) is uniformily continuous for aeé [q, 7], 
we conclude that e(t, 5) decreases to 0 as 6 decreases to 0. Let, then, tp ¢ Tand 
choose 6 so that e(t9, 6) < ge. Let U S T be a neighborhood of ty such that 
e(t, 6) < ¢ for all te U. We must verify that U is included in the right-hand 
side of the above topological equation. It is immediate. 

The method of proof just employed for (7.5) is also very helpful in other 
problems: for example we shall now validate in the model a principle which 
states that every positive function on a closed interval is bounded away from 
zero: 


VfVz, wlVxe[z, w][f(x) > 0] > ay > OVxe[z, wI[f(x) = yl]. (7.8) 


In view of continuity, and the denseness of the rationals, it is sufficient to 
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treat only the case where z = g, w =r, q <r. For simplicity we shall in 
addition assume for our g e & that 


T = [Vx e[q, r][e(x) > OJ]. 


The more general case can be treated as in the proof of (7.4). In terms of & 
we are thus assuming that 


P(t,a) > 0 


for all te T and all ae [q, r]. Define yn € & by n(t) = inf{ (7, a) : ae [q, r}} 
for t¢ 7. Again by virtue of the compactness of [g,r], we can check that 
7 is continuous. But for fixed ¢ we have ®(t,a) as a positive continuous func- 
tion of ae [q,r], therefore y(t) > 0 for all te T. We can thus easily verify 
that 


T = [Vxe[q, r}[g(x) > nJ], 


and (7.8) is established. We may remark that this construction of 7 shows 
also that every function has a greatest lower bound on every closed interval 
—a fact that can fairly easily be deduced from continuity (cf [1], p. 35 for 
example.) 

Our reduction of the 9: # > & to the : TxR = R is very useful in 
discussing other questions about the model. For example, every ordinary 
continuous function F: R — Ran be extended to &, as we have often re- 
marked, by the formula 


p(E)(t) = F(E@)). 


The question is whether every g € &? is ordinary in this sense. Well, obvious- 
ly not: let « e & be fixed and define 


O(S)(t) = C(t) +2(7). 
Could there be an F: R > R such that 
F(C(t)) = ¢(¢)+a(7) 
holds in general? If so, we could take € to be constant, say 0, and deduce 
F(0) = (2), 


thus making « constant. Since « need not be constant, the desired counter- 
example is produced. 
The foregoing simple argument leads naturally to a modification of our 
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question: given 9 € & could we find a continuous function G: Rx R-> R 
and a particular «¢ & such that 


p(E)(t) = G(a(z), ¢(¢)) 
for all €¢ & and te T? The answer is again no, but the argument is more 
difficult. (The author is indebted to Kenneth Kunen for the suggestion of the 
counter-example.) In terms of the correspondng @ the problem is to find G 
and « such that 


P(t, (t)) = Gla(t), é(¢)) 
for all €¢ & and teT. This simplifies to 
P(t, a) = G(a(t), a) 


for allae Rand t eT. We have only to choose a very bad @ to get into trou- 
ble. In view of the universal properties of N%, we can choose © so that the 
functions w, :.R - R where 


Wa) = O(t, a) 


give us all continuous functions from reals into reals. That is, ® would be 
a universal function for all real continuous functions with the parameter in 
T = N*. That is possible; but, by the equation for G above, it would also 
be universal with parameter in R. That is not possible. 

We have just shown that in the model not every function need be ordinary 
or even quasi-ordinary (i.e. ordinary in a parameter «.) There is a common 
special case where we can, however, conclude that the function is indeed 
ordinary. Suppose @ has been introduced to validate 


Vx A(x, p(x)) 


where Vxd!y A(x, y) is valid and A(x, y) has no additional parameters. (The 
formula can involve <, +,:°, other ordinary functions and predicates, the 
predicates Q and D, and so on.) We note first that must be somehow in- 
variantly determined. 

Now we ask: if what sense is @ invariant? Well, let t : T > T be any auto- 
homeomorphism of T. This extends to the model. For €¢ & we define 
t(€) so that 


a()(2(2)) = S(¢) 


for all te T. (Thus t(€) = 077") This makes it possible to regard 1: Z > 
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— #. However, it is not the case in general that t € #*; for we have 


[<(€) = t(m)] = 2(E = nJ). 


(Recall that t is naturally defined on open subsets of 7 also!) We can re- 
gard t as acting on &”, nevertheless. For any y ¢ 2” we define t() so that 


a(W)(x(6)) = tH (E)) 


for all € € &. It is easy to check that t(w) e #. Having done this, we remark 
that by the usual kind or argument, if B is any formula involving ordinary 
functions and predicates bound real and function variables, and parameters 
from & and &, then 


(IBC, «Us -. ) = [BGO Ws. I 


Applying this automorphism principle to A we have 
a(LACE, m)]) = [AGE), e))] 
because there are no other parameters. But 


[ACE, n)] = [e(2) = a]. 


hence 
[t(p)(c(E)) = t(n)] = Le(e(2) = eI. 


It follows at once that t(~) = @. 

We have thus proved that if o e &” is defined by a parameterless formula, 
then it is invariant under all automorphisms of the model determined by 
autohomeomorphisms of 7. Let us see what this means about the corres- 
ponding continuous function @: Tx R > R. Now 


P(E)(1) = F(t, E(), 


so by substitution 


(x(2))(c(2)) 


P(x(1), t(5)(4())) 
P(2(1), (t)). 


I 


But <(~) = ¢, so we derive 


p(2(E))(x(t)) = t()(t(E))(2(2)) 
= 1(p(E))(c(t)) 
p(E)(2). 
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Putting two and two together we find: 
P(t, £(t)) = P(z(2), E(2)). 


Since € is arbitrary we can say 


. 


(t,a) = (7(t), a) 


for all te T and all ae R. But hold, the autohomeomorphism t is arbitrary, 
and T = NN has a transitive autohomeomorphism group; thus 


P(t, a) = Dt’, a) 


for all ¢, t’e T. This means that © does not depend on ¢ at all and we can 
write 
@(t,a) = F(a) 


where Ff: R > R so obtained is continuous. Thus 


e(E)(t) = FE), 


and we have finally proved that @ is just an ordinary function. 

In words we could say that the above argument suggests that in intuitionis- 
tic analysis it is impossible to give an outright extensional definition of any 
function not already known from classical analysis. (Of course, the simple- 
minded identification of intuitionistic analysis with the theory of this model 
is not justified because the model is defined classically.) 

We close this section with a problem about continuity. In (7.6) the hypo- 
thesis means that the property A(x) decomposes the continuum into two 
disjoint parts. Let us weaken this by having two properties A(x) and B(x) 
where Vx[A(x) v B(x)], but where we do not make any disjointness assump- 
tion. (For example Vx[x < 1vx > O]is valid.) We can obviously not hope 
to have any such strong conclusion as [Vx A(x) v Vx B(x)] (by the example!), 
but what about this principle: 


Vx[A(x) v B(x)] > 3q, r[q < ra[Vxe [q, r] A(x) v Vx €[q, r] B(x)]]? 


The author has not yet been able to see an answer to the question of the 
validity of this formula. The situation may become clearer after a study of 
second-order arithmetic in the model. 
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SECTION D 


PROOF THEORY 
OF INTUITIONISM 


SOME RESULTS FOR INTUITIONISTIC LOGIC WITH SECOND 
ORDER QUANTIFICATION RULES 


DAG PRAWITZ 


Consider the second order logical system obtained by adding ordinary quan- 
tification rules for predicate variables (including an abstraction principle) 
to first order intuitionistic logic. For this system, I shall prove a Gentzen 
Hauptsatz (i.e. a cut elimination theorem), which will give the following 
corollaries: 

(1). If Av B is provable, then either A or Bis provable. 

(2). If 3xA(x) is provable, then A(t) is provable for some individual term ¢. 

(3). If 3X"A(X") is provable, then A(7") is provable for some predicate 

term 7” (7” is here an abstraction term defining an n-ary relation). 
These results, which certainly are consonant with intuitionistic principles, 
may have some bearing on the debated question whether a second order sy- 
stem of the described kind is intuitionistic acceptable.* 

A Gentzen Hauptsatz for classical second order logic was obtained by 
Tait [7] and later by Takahashi [8] and Prawitz [5] and [6]. Unlike the 
situation in first order logic, where a syntactical proof of Gentzen’s Haupt- 
satz for classical logic can be carried over almost without change to in- 
tuitionistic logic, these results have no immediate impact on the system that 
we are considering here. All the proofs of Gentzen’s Hauptsatz for classical 
second order logic are semantical, and to give an analogous proof for our 
present second order system, we have to supply the system with some suit- 
able interpretation. This can be done by extending either Beth’s [1] (as 
modified by Dyson and Kreisel [2]) or Kripke’s [3] interpretation of first 
order intuitionistic logic. It turns out that second order Beth models may 
be construed as special cases of second order Kripke models. We thus obtain 


* The intuitionistic significance of the described system has been advocated by Professor 
Georg Kreisel especially. I am grateful to him for encouraging me to carry out these in- 
vestigations. 


aS) 
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a stronger result by proving the completeness of our system with respect 
to Beth models. I shall now carry out this task following the pattern of 
Prawitz [5] and [6]. 


1. The languages. We consider languages containing at least individual 
parameters and variables (denoted ‘a’ and ‘x’ respectively), n-ary predicate 
parameters and variables for each n = 0,1, 2,... (denoted ‘P” and ‘X™ 
respectively, sometimes leaving out the superscript), the implication sign 
(>), the universal quantifier (V), the abstraction operator (A), and paren- 
thesis. In addition, the languages may contain n-ary function symbols for 
functions from the individuals to the individuals. Each category of param- 
eters and variables is supposed to be denumerably infinite unless otherwise 
stated. We assume that the parameters of each category are given in a cer- 
tain alphabetic order. 

The individual terms (denoted ‘t’), n-ary predicate terms (abstractions) 
(denoted ‘T”) and formulas (denoted ‘A’, ‘B’, ...) are defined as usual 
(see e.g. Prawitz [4], p. 63-64); it is to be noted that terms and formulas 
never contain free variables. We shall assume that the terms of each catagory 
have been ordered in some way. 

Negation, conjunction, disjunction, and existential quantification are de- 
fined as in Prawitz [4], p. 67. We shall also use some other ordinary syntac- 
tical abbreviations and notations, which can all be found in Prawitz [4]. 


2. The calculus 7. By a sequent in a language L we understand an expres- 
sion of the form Ir > A, where I and A are (possibly empty) finite strings of 
formulas in L separated by commas. 

If IT and & are sets of formulas, 7 = Z is to denote any sequence whose 
formulas occuring in the antecedent (succedent) are exactly those of IT (2). 
Usual abbreviations are used, e.g. A, 1] = B is an abbreviation for {4} U7 
=> JB 

We now set up the following caiculus of sequents called 7: 

Axioms. All sequents of the form 4, IT => A. 
Inference rules. 


>>) A, II=>B >) SAB. =e 
H=>ADB Al Sei, =C 
—V,) II>A Vi-) At, =>B 


VAL VxA, T=>B 
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—V,) H=>A ee) Ax, 1=>B 
x Ae VX"A, T=>B 
A) a(S" A) A>) (Sint, "A), HB 
= ee el eet Pee ea Al io. 1 ll pb 


cut) Tr=A A,T=>B 
Tr=B 


The rules > V, and >V, are with the restriction that a and P”, respectively, 
do not occur in the sequent below the line. 

Proofs (in tree form) and provability are defined as usual. 

Remark. It is easily seen that structural rules that have not been stated 
above (e.g. thinning) hold as derived rules. It is also to be noted that the 
ordinary inference rules for negation, conjunction, disjunction, and existen- 
tial quantification hold as derived rules by the way we have defined these 
concepts. 

A sequent containing at most one formula in the succendent will be 
called an intuitionistic sequent. All sequents provable in the calculus 7 
above are of course intuitionistic. However, to prove that the cut rule 
is derivable from the other rules of 7, it will be convenient to consider 
another calculus 7’ where also other sequents may be provable. 


3. The calculus 7’. The calculus 7’ is set up as follows: 
Axioms. All sequents of the form A, IT = 2, A, where A is atomic. 
Inference rules. 


+>) A, {=>B >) Mle Al 1B, = 
=>%,, Ass Al) 153, MES, 

V1) II=>A Vi) Anis 

I=, Vx A‘, Vx Aa i> 
—V,) EH=+A ia) ees, 

Ht, VX"Ake VX"A, T= 
=A) Mad, (Si, A) As) (Sherin’), Td 

UD Aa energie. t, Aol its 2. . t,, Td 


cut) =>), A A, lif 
i, 
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The rules +V, and -V, are with the same restrictions as in 7. 

One easily shows (by induction on the length of the given proof) that a 
sequent I > A is provable in.7’ only if there is a formula A in A such that 
I > A is provable in 7’. From this, one easily obtains: 


LEMMA 1. An intuitionistic sequent, i.e. of the form I — A, is provable 
in F (without cut) if and only if it is provable in J’ (without cut). 

To prove that the cut rule is derivable from the other rules of the calculus 
7, it is thus sufficient to prove the corresponding thing for the calculus 7’. 


4, Beth models. The (second order) Beth models will be certain trees of 
formulas paired with a language. By an (undecorated) tree we understand 
a pair (K, R) where K is a non-empty set of points or nodes and such that 
there is a relation S in K and a point k € K (called the origin) with the prop- 
erties: (i) R is the ancestral of SS, (ii) there is no k’ in K such that k’Sk, 
(iii) to each k’ in K except k, there is a unique k” such that k’’Sk, (iv) for 
each k’ in K, kRk’. The relation S just described may be read ‘stands im- 
mediately below’. By a path in a tree (K, R), we understand a sequence ky, 
k,,... of points in K such that k, is the origin, k; stands immediately below 
k;,, (for each k; and k;,, in the sequence), and k, is the last point of the 
sequence only ifitis a top node (i.e., there is no k’ in K different from k, such 
that k,Rk’). A path is said to go through k if k belongs to the path. A point 
that belongs to a path is said to be on the path. 

A Beth model is now defined as a quadruple (L, K, R, ~) where L is a 
language, (K, R) is a tree, and ¢ is a function that assigns a set », of atomic 
formulas in L to each k in K, satisfying the following condition: for each 
point k in K, if A does not belongto ¢,, then thereis a path in (K, R) through 
k such that A does not belong to 9,., for any k’ on the path. 

Let -/@ be a given Beth model (L, K, R, @), let A be a formula in L, and 
let kK be a point in K. By definition, A is true (in /) at the point k, abbre- 
viated S(A, k) (or S,(A, k)), if and only if either 


(1). Ais atomic and A belongs to @,, or 

(2). Ais B > C, and for each k’ such that kRk’, if S(B, k’), then S(C, k’), 
or 

(3). A is Vx8 and for cach 7 mE S( 8: ek eon 

(4). A is VX"B and for each P" in L, S(B}n, k), or 

(3). A is Ay Xs... X, Bite eels (Sp eae ic) 


A sequent If = 2 in L is true (in # = (L, K, R, ¢)) at k, if for each k’ 


D XVII SECOND ORDER INTUITIONISTIC LOGIC 263 


in K such that k Rk’, if all formulas of H are true at k’, then some formula of 
% is true at k’. 
One easily proves by induction over the length of the formulas: 


Lema 2. If A is not true in-M = (L, K, R, o) at k, then there is some path 
in (K, R) through k such that A is not true in at any point on the path. 

By the denotation (in a Beth model.@ = (L, K, R, ¢)) of a predicate term 
T" in Lata point k in K, abbreviated d(7”, k) (or d(T”, k)), we understand 
(i) the set of n-tuples (t,, ¢2,..., ¢,) such that T"t,t,...4, is true (in-@) 
at k, when n > 0, and (ii) when » = 0, the truth value 1 or 0 if T” is true or 
not true, respectively, (in.#) at k. 

One easily verifies: 

Substitutivity of equivalence. Let “ = (L, K, R, g) be a Beth model, let 
A be a formula in L, let T and T’ be n-ary terms in L such that d(T, k) = 
= d(T", k) for each k in K, and let A’ be obtained from A by replacing 
some occurences of T with 7’. Then, for each k in K, A is true in-# at k if 
and only if A’ is. 

A Beth model.# = (L, K, R, @) is said to be normal if for each T” in L 
there is a parameter P” in L such that d_4(7", k) = d.4(P", k) for each k in K. 

A formula or sequent in L is said to be valid if it is true in each normal 
Beth model (L, K, R, @) at each point k in K. 

By the definition of truth and the substitutivity of equivalence we obtain: 


Lema 3. If @ = (L, K, R, o) is a normal Beth model, then VX"A is true in 
M at k, if and only if Aj, is true in Ml at k for each T" in L. 


Using lemma 3, one proves by induction over the length of proofs: 


THEOREM 1. Every sequent provable in Z or J" is valid. 


Remark 1. A somewhat more natural procedure would be to replace the 
language L in a Beth model with a sequent of domains D°, D', D’,..., 
where D® is a set of individuals and D” is a set of n-ary relations in D°, and 
then to define satisfaction in the usual way. However, the definition given 
above (which of course amounts to the same thing) will be convenient in the 
subsequent completeness proof. 

Remark 2. A (second order) Kripke model would be defined as a quadruple 
(K, R, 9, W), where K, R, and @ are as before, and w is a function that assigns 
a language (or a sequence of domains as in remark 1) to each point in K, and 
where the following two conditions are satisfied: for each k and k’ in K 
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if KRk’, then p(k) S o(k’) and W(k) S W(K’). Unlike the situation in the 
first order case, the second order Beth models are thus a special kind of the 
second order Kripke models. 


5. Refutations. The points in a tree may be divided into levels in an obvious 
way, and the levels may then be numbered in such a way that the origin is 
the point on level 1. By a refutation of a sequent S, we understand a language 
L paired with an infinite tree of sequents (i.e. a quadruple (L, K, R, ) where 
(K, R) is a tree and assigns a sequent to each point in K), where L is a lan- 
guage which contains infinitely many parameters of each kind besides the 
symbols of S$, and where the following conditions are satisfied: 
(1) S is the bottom-sequent (i.e. assigned to the origin). 
(2). No sequent (assigned to a point) in the tree is an axiom. 
(3). If the sequent S, = I > A, A is a sequent on an odd level (i.e. assigned 
to a point on an odd level), then there is one sequent S, or two sequents 
S, and S; immediately above S, (i.e. assigned to point(s) k, (and k3) such 
that k, stands immediately below k, (and k3)) as follows: 
(a). If A is atomic, then S, is ! > A, A and there is no S3. 
(b). If Ais B > C, then S, isl! > Aand S;is I’, B > A, C. 
(c). If A is VxB, then S, is ! > A and S3 is F > Bz where a is the first 
individual parameter in Z not in S,. 
(d). If A is VX"B, then S, is F - A and S; is [ > Bj}, where P” is the 
first n-ary predicate parameter in LZ not in S,. 
(ce). If A is Ax, %>...%, Bio f thenes is AS: es) eon 
there is no S3. 
(4). If the sequent S, = A, I - A is a sequent on an even level, then there 
is one sequent S, immediately above S, as follows: 
(a). If A is atomic, then S, is, A > A. 
(b). If Ais B > Ci then S, is), 2 = G = Anson © — 24. 
(c). If A is VxB, then S, is ', B, VxB— A where ¢ is the first individual 
parameter in Z such that B* does not occur in the antecedent of a sequent 
below S.. 
(d). If A is VX"B, then S, is r, BY., VX"B — A where 7” is the first 
n-ary predicate term in L such that B%, does not occur in the antecedent 
of a sequent below S). 
(ec). If A is AR x, Bit, ee ticu ier (5,0) oe 
A refutation of a sequent S is uniquely determined except for L and 
the choice to be made in (4b). By comparing the inference rules in 7’ 
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with the clauses (3) and (4), it is easily seen that if the sequent(s) S, (and 
S3) stands (stand) immediately above S, and is not determined by clause 
(4b), then if S, is unprovable in 7’ without the cut rule so is (are) S, (and 
S3). Furthermore, if B > C, I — A is unprovable in 7’ (without cut), then 
[',B>C- A, B and [,C > A cannot both be provable in 7" (without 
cut). Hence, if S is unprovable in 7’ without the cut rule, we can construct 
a refutation of S by choosing an unprovable S, in clause (4b). We have thus 
proved 


LEMMA 4. There is a refutation of each sequent which is unprovable in 7’ 
without the cut rule. 


Note that the choice to be made at applications of clause (4b) is not ef- 
fective in general; lemma 4 is thus proved only classically. 

Given a refutation (L, K, R, y) and a point k in K, we define: 

A is positive at k if and only if on each path in (K, R) through &, there is a 
point &’ such that A occurs in the antecendent of the sequent y,-. 

A is negative at k if and only if there is some point k’ such that KRk’ and 
A occurs in the succedent of y,.. 

From the definition of a refutation (LZ, K, R, y), we verify the following 
facts: 


LEMMA 5. (a). If A is atomic and negative at k, then there is a path through 
k such that A is negative at each point on the path. 
(b) If A is atomic and positive at k, then A is not negative at k. 
(c)(1). If A > B is positive at k, then for each point k’ such that kKRk’, 
either A is negative at k' or B is positive at k’. 
(2). If VxA is positive at k, then Af is positive at k for each t. 
(3). If VX"A is positive at k, than A%, is positive at k for each T". 
(ye At tt, 1s posuive at k, then(S,\ 7)" A)is positive 
Ql. 
(d)(1). If A > B is negative at k, then for some k' such that kRk’, A is 
positive at k’ and B is negative at k’. 
(2). If VxA is negative at k, then A; is negative at k for some t. 
(3). If VX"A is negative at k, then Af» is negative at k for some T”. 
(Aro dixie eta et, ts) egative at k, then (S;i-,°"A) is 
negative at k. 
Proors. All the proofs are quite straightforward with the exception of 
case (cl), which is obtained as follows: Assume that A > B is positive at k 
and that kRk’. We consider two cases: 
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Case (i): A > Bis positive at k’ with respect to the subtree whose origin 
is kK’. Then for each path through k’ there is a point k”’ such that k’Rk”’ and 
such that the sequent assigned to k’’ is obtained by applying case (4b) to 
the formula A > B. There are now two possible cases: either all of these 
applications of (4b) are obtained by writing B in the antecedent in which 
case Bis positive at k’, or some of these applications of (4b) are obtained by 
writing A in the succedent in which case A is negative at k’. 

Case (ii): A > Bis not positive at k’ with respect to the subtree whose ori- 
gin is k’, Then there must be some point k”’ such that k’’Rk’ and such that 
the sequent assigned to k’’ is obtained by applying the case (4b) to A> B 
writing B in the antecedent. Hence, in this case, B is positive at k’. 


6. Main results 


THEOREM 2. Every valid sequent is provable in" without the cut rule. 

Proor. Let © > A be a sequent that is not provable in 7’ without the cut 
rule, and let Z = (L, K, R, w) bea refutation of f > A (lemma 4). 

By a possible value of a term 7”, n > 0, (with respect to #) we understand 
a function V on K such that for each k in K 

(ll). V(R) 1s @ sctiot n-tuples (7a: eee) 

(2). if Tijio. 207, is positiverat k, theni(¢;, gee ie Yo): 

(3). if 71, t, ... 7, 1s megative atk, them (7? metas Fe): 

(4). if (¢,, f2,...,4,) € V(K), then there is a path in (K, R) through k 

such that (7,, f2,...,¢,) € V(k’) for each k’ on the path. 


By a possible value of a term T°, (with respect to #), we understand a func- 
tion V on K such that for each k in K: 
(1). V(k) is one of the two truth values 1 (truth) and 0 (falsehood); 
(2). if T° is positive at k, then V(k) = 1; 
(3). if T° is negative at k, then V(k) = 0; 
(4). if V(k) = 0, then there is a path in (K, R) through k such that 
V(k') = 0 for each k’ on the path. 


For each 7” and for each possible value V of T", we introduce a new con- 
stant Py not in Z in such a way that P/ is different from Py., if V4 V’. The 
language obtained from L by adding all these new parameters is denoted by 
L’. If Ais a formula in L’, then A* is to be a formula obtained from A by re- 
placing every one of the new constants P? with a term Jin L that has Vasa 
possible value. 
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Let-@ be the quadruple (L’, K, R, g) where L’, K and R are as above and 
g is the function from K such that ¢, (k in K) is the set containing (i) all 
atomic formulas positive in & at k, (ii) all formulas Py such that V(k) = 1, 
and (iii) all sentences Pyt,t,... 4, such that (¢,, f2,..., f,) belongs to V(k). 
We want to prove: 

(a). @ is a Beth model. 

(b). If A* is positive in Z at k, then A is true in-# at k. 

(c). If A* is negative in # at k, then A is not true in-# at k. 

(d). @ is a normal Beth model. 


The theorem may then be inferred as follows: Since the formulas of F (4) 
are positive (negative) in Z at the origin of the tree (K, R), they are thus true 
(not true) in -@ at this origin (note that A* = A, if A is a formula in L). 
It follows that [ > A is not true in the normal Beth model .#. Hence, [ > A 
is not valid. 


Proof of (a). Follows from lemma 5(a) and clause (4) in the definition of 
possible value. 


Proof of (b) and (c). The proof is by induction over the length of the for- 
mula A. If A is atomic and is a formula in L, then (b) is obvious from the con- 
struction of g, and (c) follows from this construction and lemma 5(b). 
If Ais P? or Ppt, t, ...t,, then (b) and (c) are obvious from the construction 
of g and clauses (2) and (3) in the definition of possible value. 

The induction step is obvious by comparing lemma 5(c) and 5(d) with the 
definition of truth. We take one example: Suppose (VX"B)* is positive at 
k. Since (VX"B)* = VX"B*, it follows from lemma 5(c3) that B*f,. is positive 
at k for each T” in L. For any P” in L’, (Bp.)* = B*}, for some term 7” in 
L. Hence, for each P" in L’, (B}n)* is positive at k. By the induction assump- 
tion, B>, is then true in-# at k for each P"in L’, i.e. by clause (4) in the de- 
finition of truth, VX”"B is true in-# at k. 

Proof of (d). We first verify that for each term 7, d.,(T, k) (considered 
as a function on K) is a possible value of T*. Condition (1) is obviously 
satisfied; conditions (2) and (3) are satisfied according to (b) and (c), re- 
spectively; and condition (4) is satisfied according to lemma 2. It follows 
that there is a parameter Py, in L’ such that d,(7, K) = d4( Py, k) for each 
k in K. 


THEOREM 3. Jf a sequent is provable in FZ or J", it is provable without using 
the cut rule. 
Proor. Assume that [ > 4, A and A, — A are provable in J or 7’. 
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By theorem 1, Fr > A, A and A, Fr > A are then valid. It follows that Fl — A 
is valid, and hence, by theorem 2 (and lemma 1), also provable in 7’ (7 ) 
without using the cut rule. 


THEOREM 4. (a). If >A Vv B is provable, then either +A or —B is provable. 
(b). If +4xA is provable, then so is Af for some t. 
(c). If +4X"4 is provable, then so is Ajn for some T”. 

Proor. We prove part (a) and (c) of the theorem for the calculus 7. 
The proof of part (b) is similar to that of part (c). 

Proof of (a). Assume that >A v B, i.e. 


~VX((A > X) > ((B > X) > X)) 


(where X is a O-ary variable not occuring in A or in B) is provable. It follows 
(by the fact that the rules +V and +> are inversible) that 


Al =) 12, 1 SS) IP IP 
is provable, where P may be choosen foreign to A and B. Hence, either 
AD>P,B>P->A or ADP,BDBUP-B 


is provable. Since P does not occur in A or B, we obtain by substituting 
AY B for P that, either 


A>(AvVB),B>(AvB)>~A or A2>(AvVB),B>(AvVB)>B 


is provable. Since >A > (Av B)and >B > (Av B)are provable, it follows 
that either >A or —B is provable. 
Proof of (c). Assume that >1X"4A, i.e. 


3VY(VX"A > Y) > Y) 


(where Y is 0-ary variable not occuring in VX"A) is provable. As in the proof 
of (a), we can infer that for some P not occuring in VX"A and for some n-ary 
terms 1,, 1>,....." 


VX"(AS P), A, ee eee 
(where 1 < i < m) is provable. Since sequents of the form 
3VX"A >4X"A) and >A} > 4K" 


are provable, we infer by substituting JX¥"A for P that >A%’ is provable 
for some n-ary term T. 
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Remark. Theorem 4 also holds when the antecedent is not empty as long 
as there is no predicate variable in an atomic and strictly positive part of 
a formula in the antecedent. The strictly positive parts of a formula A are 
defined recursively as follows: 

(1). A is a strictly positive part of A. 

(2). If C > B, VxB, or VX"B is a strictly positive part of A, then so is B. 

(3). If Ax, x,...%x,Bt,t,...1, is a strictly positive part of A, then so is 

(Se eemeo) 
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ON CUT ELIMINATION IN INTUITIONISTIC SYSTEMS 
OF ANALYSIS 


B. SCARPELLINI 


In what follows we consider certain intuitionistic systems of analysis, for- 
mulated in terms of the sequential calculus. Our aim is to outline proofs 
of certain results, in part on cut elimination, about these systems. 


1. We start by giving a quick review of Gentzen’s second consistency proof. 
As language L we take one whose alphabet contains the following kinds of 
symbols: 
(1). logical symbols a, v, >, 4, V, 
(2). brackets (, ), 
(3). the individual constant 0, 
(4). constants for certain primitive recursive (p.r.) functions such as 
+, x,’, etc., 
(5). the equality sign =, 
(6). individual variables x,,x2,..., x, y, etc., 
(7). function variables for one place number theoretic functions o,, 
Woe y eles 
(8). for every finite sequence w = <uo,...,u,> of natural numbers a 
denumerable list of so called special function constants «),, Goer. 
(9). the sequential arrow —>. 
Terms, prime formulas and arbitrary formulas are defined as usual inductive 
ly. An expression W,,..., %, + %,,..., 8B, where M;, B, are formulas is 
called a sequent. As sublanguage L* of L we take the one whose alphabet 
contains only the symbols listed under (1)-(6) and (9). Formulas and se 
quents are then defined with respect to this alphabet. The rest of section | 
refers only to L* as basic language. The terms 0, 0’, 0”,... are called nu- 
merals and correspond as usual to the natural numbers. A sequent is called 
a prime sequent if it contains only formulas of the form p=q, that is prime 
formulas. At this stage let us associate once and for all with every n-place 
eal 
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constant c for p.r. functions a fixed p.r. function ~, of n arguments; in par- 
ticular we associate with +, x,’, addition, multiplication and successor, in 
that order. This valuation induces in a natural way a mapping A which 
associates with every term t containing precisely n variables a p.r. function 
h, of n arguments. In particular ifn = 0 then 4, is a number which will also 
be denoted by |t|. Now we can call a constant prime sequent ¢rue if it is true 
under the usual interpretation. A prime sequent is called verifiable if it is 
true whenever we replace the free variables by constant terms. Finally let 
us call two formulas 2, 2’ isomorphic (to each other) if there is a formula 
B(x,,..-, x,) and two lists of constant terms t,,..., t, and q,,..., q, such 
that 

(1). Wis SU eee 

(2). W is Br,» 45) 

(3). Iq = lt] for i < s. 
Now we can introduce the sequential version ZT of classical number theory. 
The rules of ZT are given by the following list: 

(1). the rules of the first order sequential predicate calculus, 

(2). induction, 

(3). a special rule called conversion, defined as follows: 


Ney lee a 


Of oe 


where %,, 2; and B,, 8, are isomorphic. 
The axioms of ZT are roughly speaking the following: 
(1). all numerically true prime sequents with at most one formula in the 
succedent, 
(2). a sufficiently large set of verifiable sequents containing among others 
all axioms of equality, 
(3). all sequents of the form D > D’ with D, D’ isomorphic. 
Having the rules and the axioms we can define proofs in the usual way. 
Proofs will always be considered as trees whose nodes are sequents. A clas- 
sical proof F is intuitionistic if it has the property: A contains only sequents 
having at most one formula in the succedent. If we admit only intuitionistic 
proofs then we obtain the intuitionistic version of ZT, to be denoted by ZTi. 
Before looking at Gentzen’s second consistency proof let us repeat the 
definition of the final part of a proof W: a sequent S belongs to the final 
part of a proof if it is either the end-sequent, or if it is a premise of a conver- 
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sion, or a structural rule whose conclusion belongs already to the final 
part. We also use the self-explaining notions of ‘successor’ and ‘image’ 
for formulas in the final part. Definititions of this or similar notions are 
given in [1], [4], [6]. In order to prove consistency Gentzen introduced in 
{1] certain syntactical transformations of proofs, called reduction steps. 
There are three kinds of these, namely 

(1). elimination of logical axioms and of thinnings from the final part, 

(2). elimination of logical inferences from the final part, 

(3). elimination of induction inferences from the final part. 
Reduction steps of type (1) will be referred to as preliminary reduction 
steps, while steps of type (2) and (3) are called essential reduction steps. 
The precise definition of these reduction steps (or simply reductions) is 
not needed. It is sufficient to know a few of their properties. 

The proof of the following theorem is elementary and is given in [1]. 


THEOREM 1. (a). Only finitely many successive preliminary reduction steps 
can be applied to a given proof F. 

(b). Let F be a proof whose final part contains no variables and which 
does not admit any reduction step. If F is different from its final part then there 
is at least one logical inference whose conclusion is in the final part and whose 
principal formula has an image in the end-sequent. 


If F’ is obtained from FY by means of a series of preliminary reductions 

and one essential reduction then we denote this fact symbolically by A’'VF. 

Then we define: P’V* F iff there are proofs P),..., A, such that Py = F, 

Zao sande yy @ ton: <5) itis clear that V~ 1s transitive. 
Gentzen’s main result is 


THEOREM 2. The relation V* is well founded. 


Gentzen proved theorem 2 by associating with every proof F an ordinal 
g(P) and by showing that A’VF implies g(#’) < g(P). 

In order to state an immediate consequence of theorems 1 and 2 we recall 
the notion of a constructive, infinite cut free proof, introduced in [1]. By 
transfinite induction over V*, making thereby use of theorems 1-2 one pro- 
ves easily 


THEOREM 3. If F is a proof in ZT of S then there exists a constructive in- 
finite cut free proof F’ of S. If F is intuitionistic then F’ is intuitionistic. 


Gentzen proved his results for a subsystem of ZT which does not contain 
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conversion or implication. However, it is easy to extend his procedure so 
as to include ZT; details concerning this are given in [6]. It is also not dif- 
ficult to extend Gentzen’s method so as to include the intuitionistic system 
ZTi; this is also treated in [6]. 

Another immediate consequence of theorems 1-2 for ZTi, which is prov- 
ed in [6] is 


THEOREM 4. (a). If F is an intuitionistic proof of > Av¥B, where A,B 
contain no free variables, then one can find effectively a proof F’ of > X or 
> %. 

(b). If F is an intuitionistic proof of > IxX(x) where A contains only x 
free, then one can find effectively a term t and an intuitionistic proof P' of 


> X(t). 


2. Next we look at weak extensions of ZT for which Gentzen’s method still 
works. Let D(x) be a p.r. predicate and < a p.r. partial ordering of 
{x|D(x)}; assume that < is well founded. Let g and 0 be two constants in 
ZT which express < and D numeralwise. For easier reading we abbreviate 
a(x, y) = 0 by x < y without danger of confusion and for d(x) = 0 we 
write just d(x). Assume for simplicity that ZT contains already the following 
axioms: 

(1). p<q,q<xt—p <t, 

(2)e 0 ies 

(3). P<q@p=q>, 

(4). p<q > d(p), 

(5). p<q > d(q), 
where p, q, t are terms. By ZT(<) we denote that extension of ZT which 
has the following additional rules: 


d(y), (x)\(¥ < y > Mx), F > A, Uy) 
TI SSE eS 
d(t), P > A, A(t) 


and for every t with D(|t|) true the following rule 


y <b, (x(x < y > U(x), F > A, Uy) 
Thy 90 Sa 
p <tr 4, Up) 


where y does not occur in the conclusions and where t in the fist case and p 
in the second are supposed to be free for y in 2[(y). The two rules are denoted 
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by TI and TI, respectively (with |t] = 7) and have already been introduced 
in this form in [2]; they express adequately transfinite induction with 
respect to < and {<x,y>) |x~<n Ay<nax<y}. The rules TI, are 
superfluous but have been added for technical reasons as will be seen below. 

To the reduction steps already used by Gentzen we add new ones, namely 
those for TJ- and TI,-inferences with conclusion in the final part. Let FY be 
a proof containing no variables in the final part. Let there be a particular 
TI-inference having its conclusion in the final part. 

Case 1. D(|t|) is false. Then d(t)— is an axiom from which d(t), [> A, 
Y(t) can be derived by thinning and interchange. 

Case 2. D(|t|) is true. Now y < t > d(y) is an axiom and so we can per- 
form the new detivation presented below. Here we have suppressed I, A in 
order to save space, and abbreviated (x)(x < y > A(x)) by (x) <A. 


y<tody) dy), (x)-,A% > A) 


cut 
y =~ i, (x)<, U a A(y) 
Thy Stn 
8 <t— M8) 
->D Ss 
—-8<{>d (8) 
—3V —— 
Se! (x)~, 2, d(t) > M(t) 
cut 


d(t) > 2(4) 


The reduction step for TJ, looks quite similar: 

Case 1. |p| K |t|. Then p < t > is an axiom from which p <t, fF > 4, 
Y(p) can be derived by thinning and interchange. 

Case 2. |p| < |t|. Now y < p, p < t = y < t is an axiom and so we can 
perform the new derivation presented below. There we observe the same con- 
ventions as before. 


y<pp<toy<t y<t(x).,% > Ay) 
cut SS 
y <a ee)e, ay <t— Uy) 
i i a == 
8<p,p <t— A(8) 
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p<t-8<p> AG) 
pxt>(x)_A  (x)<, Lp <t— Alp) 
cut I i i ia. =. 


p<t— Up) 


With these reduction steps (called TI- and TI,-reductions for short) added 
to the previous ones it is an easy task to prove the analogue of theorem 1 in 
an elementary way. Gentzen’s program is completed if we can prove theo- 
rem 2 also for the present case. Let us indicate briefly how this is achieved. 
In this context we use the notions and notations used in [1] and [7]. First 
we associate with every premise S of a TI- or a TI,-inference the number of 
logical symbols in (x)~,2{ and call it the order of S. If Sis the premise of a 
cut or an induction then we take as the order of S the number of logical sym- 
bols in the cut formula or the induction formula respectively. Now we can, 
as in [i] and [7] associate with every sequent S in a proof Ff another num- 
ber, called its height and to be denoted by A(S). Finally let € be the ordinal 
associated with < and €, the ordinal associated with {<x, y>|x<nay<na 
x<y} where D(n) is true. Let S be the premise of a TI-inference and 
S' its conclusion. If « is the ordinal of S then we take w,((« + w*)w*) 
as ordinal of S’ where d = A(S)—A(S’). If S is the premise of a TI-inference 
then we take w,((« + w*)c*) as ordinal of S’ where 4 = €,. The verifica- 
tion that a TI- or TI,-reduction lowers the ordinal of (the end-sequent of) 
the proof FY requires a routine calculation which is omitted. For the pre- 
vious reduction steps everything remains the same as before and so we obtain 
in a Straightforward way the analogue of theorem 2 for ZT(<). With theo- 
rems 1-2 at our disposal we can prove theorem 3 by transfinite induction over 
V* where V* is defined as in section 1 but with respect to the reduction steps 
of ZT(<). In the same way we can now also prove theorem 4 proceeding 
exactly as in [6]. 


3. Before coming to the main part of this paper we consider a conservative 
extension ZT*(<) of ZT(<). Formulas and sequents of ZT*(<) are now 
defined with respect to L. While the valuation of constants for p.r. functions 
is retained, we can explain the role of the special function constants as fol- 
lows: if w = <uo,...,U,-,> then a!, represents a function whose first s 
values are determined by w but which is unspecified otherwise. This leads 
us to consider a term such as e.g. t(a, «!,, x) (containing no other variables 
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and special function constants) as representing a certain continuous func- 
tional t(a, f, x) where « may run over arbitrary functions while f is restrict- 
ed to those functions for which f(i) = u; for 0 < i < s—1. Let us call a 
term saturated if (1) it contains no variables but possibly special function 
Constants, 9s -,¢,,.9(2) if theisequences w,,-..,w, determine the nu- 
merical value of t completely. For a saturated term we denote by |t| its nu- 
merical value. The notion ‘2{ is isomorphic with 2” is defined as in section 1 
except that the terms ft, q are now only required to be saturated. A prime 
sequent is called saturated if it contains only prime formulas t = q with t, q 
saturated. A saturated prime sequent is called numerically true if it is true 
under the usual interpretation. Verifiable sequents are defined as before. We 
now take as axioms for ZT*(<) all those of ZT(<) (with ’isomorphic’ and 
‘numerically true’ in the new sense) plus for every special function constant 
at, with w = <uo,....Us—,> the axioms — ai,(k) = u, for k < s—1. The 
rules of ZT*(<) are essentially the same as those of ZI(<) except that now 
also quantifier rules for function symbols are available and that the conver- 
sion rule is now defined with respect to the new notion of isomorphism. 
We can now prove in exactly the same way theorems |, 2 as before. A minor 
complication related to the presence of function variables appears now in 
the proof of theorem 3; it is in this connection where the use of the a, is 
very convenient. We will meet this complication again in the proof of theo- 
rem 3* in section 4. Theorem 4 remains true under the additional assumption 
that the formulas (4x)2 and 2v%6 contain neither function variables nor 
special function constants; its proof parallels the one in [6]. 


4. Denote by ZT* the theory which we obtain from ZT*(<) by omitting 
TI, TI, (that is just classical number theory but with reference to the lan- 
guage L). Let L(y, z, x) and D(y, x) be two p.r. relations such that for every 
x € {<y, z>|L(y, z, x)} is a partial ordering of {y|D(y, x)}. Let g(x, y, z) and 
D(y, x) be terms such that L and D are formally represented by g(x, y, z) = 0 
and d(y, x) = 0. Let us abbreviate g(x, y, z) = 0 by y<,z or sometimes 
even more simply by y < z; let us abbreviate D(y, x) = 0 by d(y) or by 
d,(y). For simplicity we assume that ZT* contains all the axioms 

(I).p<qq<t>p<t, 

(Qmpeaqngespe, 

(3). p<q.p=q-, 

(4). p<q — d(p), 

(5). p <q  d(q). 
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The system ZB is obtained from ZT* by adding the following new rule: 


d,(y), (x)(x <: y > U(x), F > A, Ay) 
MPR Sa ee 
W(<,), &(q), F > 4, U(q) 


where ¢ is an arbitrary term, q a term free for y in U(y) and y a variable 
which is not free in the conclusion. W(<;,) is the formula («)(4x)(—a(x +1) 
~<,«(x)). By an appropriate choice of <, it is easy to show that ZB is 
equivalent to the classical version of the theory Tlgy in [3]. The intuitionis- 
tic version ZBi is again obtained by admitting only those proofs which con- 
tain only sequents having at most one formula in the succedent. If we en- 
quire whether it is possible to prove theorems 1-4 also for ZB (with appro- 
priate reduction steps) it turns out that this is impossible in view of a theorem 
of G. Kreisel whose proof is outlined in the appendix of section 1 of the 
Stanford report. For ZBi however we shall see that reduction steps can be 
introduced so as to obtain theorems 1, 2 and 4 and a restricted version of 
theorem 3. To this end we need a lemma which is almost trivial to prove but 
which is basic for all later considerations. We remind that a proof F in 
ZBi is a certain finite tree; the final part of P (which is defined in the same 
way as before) is a certain subtree of Z. Therefore we can arrange the upper- 
most sequents of the final part in a well-determined way from left to right: 
So>+++) S,- One easily shows: for two uppermost sequents S’, S” of the 
final part, S’ is left of S” iff there is a cut in the final part with left premise 
Sf and right premise S3, such that Sj is below or equal to S’ and Sj is below 
or equal to 5. 


(Basic) LEMMA 1. Let F be an intuitionistic proof which contains no thinning 
on the right in the final part and whose end-sequent has the form — %. Let 
So +++> 5, be the uppermost sequents of the final part listed from left to 
right, let S; be T; > Y,;. 
(1). [fi < nwe can effectively find a proof PF’ of > X,, 
(2). if B occurs in some T';,j <n, then one can find effectively a proof of 
—> B. 


REMARK. A stronger version of this lemma is given in [6], theorem 7 but 
only for ZT. The assumption that there is no thinning on the right in the final 
part of F could be dropped but helps to simplify the argument below. 
Proor. We proceed by simultaneous induction with respect to 7. If i = 0 
then (1) and (2) are satisfied by Sp which necessarily has to be of the form 
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— %,. Let (1) and (2) be true up to i-1. If 8 occurs in I’;, then there is a cut 
with left- and right-premises S’ and S” respectively whose cut formula 8* 
in S"’ is an image of this 8. Since there is no right thinning present in the final 
part of F we necessarily find an S, with « < isuch that the cut formula $* 
in S’ is an image of the A, in [, > %,. By induction there is a proof of > %f,. 
Since 8* is isomorphic both with YW, and with %, there is also a proof of 
— %. Since now — 8% is provable for every 8 in I; and since a proof of 
I’; > YU, is already at hand there is also a proof of > Y;, thus verifying both 
(1) and (2) for S;. 
Next we say that a proof in ZB has degree n if every formula in Y contains 
at most » logical symbols. Denote by ZBi, the subsystem of ZBi obtained by 
admitting only proofs of degree n. Now we introduce a certain conservative 
extension of ZBi, namely ZBi’ which has all the rules and axioms of ZBi and 
in addition the following rules: 

(1). If e is the Gédel number of a proof F in ZBi of W(<;,) where t is a 
saturated term, the rule 


diy), X)& <:y > UQ)), F > UO) 


TRI e 
dq), T > X(q) 


with q free for y in 2f{(y) and y not free in the conclusion. 
(2). With e, t, q as before and for m such that D(n, t) is true, the rule 


ene ey a (x), 2) 
TRE —<_———— eee 
q <,n, F > A(q) 


A proof #’ in ZBi’ has degree n if all formulas in Y’ contain at most 47 lo- 
gical symbols and if every Gédel number e in an inference TRI, or TRI, 
is the Gddel number of a proof of degree n. Restriction to proofs of degree 
yields a subsystem ZBi, of ZBi’. It is easy to prove 


LemMaA 2. Jf ZBi, + %& then ZBi, + %. 


For each 7 we make the following assumption 4,;: if 8 is a J7}-formula such 
that ZBi, + 8 then % is true. 

REMARK. Lemma | holds of course invariably for ZBi’, ZBi, and ZBi,, as 
follows from its proof and lemma 2. By lemma 2, A, remains true if we re- 
place there ZBi, by ZBi,. Call a proof F in ZBi’, normal if all its terms in the 
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final part are saturated, if its end sequent has the form — Y, and if no 
thinning occurs in the final part. For normal proofs we define reduction steps; 
with the exception of one we have introduced all these reduction steps in sec- 
tions 1 and 2 already. Specifically we have (1) elimination of logical axioms 
and thinnings from the final part, (2) elimination of logical inferences from 
the final part, (3) elimination of induction steps from the final part. We have 
TRI,- and TRI}-reduction steps which are exact copies of the corresponding 
TI- and TI,-reduction steps introduced in section 2. Finally, if there is 
a TRI-inference 


dy), (x)-<,%, F > Uy) 


TRI 
W(<,), a(q), F > Aq) 
with conclusion in the final part, then, in view of lemmas | and 2, the remark, 


and the definition of normal proof, we find a Gédel number e of a proof in 
ZBi of W(<,). Hence we can replace the TRI-inference by a TRI,-inference: 


di(y), (x)(x <;y > W(x)), F > Uy) 
TRI 


d,(q), F > X(q) 
thinning a 


W(<,), 4(q), F > X(q) 


Reduction steps of type (1) above are again called preliminary, all other 
reduction steps are called essential. For normal proofs Y, FW’ we define 
PNF iff F’ follows from FY by means of some preliminary and one essen- 
tial reduction step. The relation V* is then defined in the same way as the V* 
in section 1. For normal proofs we can again prove theorem 1 insection 1 inan 
elementary way. In order to prove theorem 2 for our V* we have to associate 
ordinals with proofs in such a way that an essential reduction step lowers the 
ordinal of the corresponding proof FY. Let us denote by 4, the smallest ordi- 
nal larger than the ordinal of any <,, for which ZBi, + W(<,,) holds. Next, 
a natural number, called order, is associated with every sequent S in a proof 
F which is premise of one of the following inferences: (1) cut, (2) induc- 
tion, (3) TRI, TRI, or TRIG. In cases (1), (2) we take the number of logical 
symbols in the cut or induction formula, in case (3) we take the number of 
logical symbols in (x)(x<,y > U(x)). Having defined the order wecanassociate 
a height A(S) with every sequent Sin Yin the same way as in [1] and [7]. Now 
we associate an ordinal o(S) with every sequent S in F and distinguish cases 
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according to whether S is (1) an axiom or else the conclusion of (2) a con- 
version or a structural rule other than cut, (3) of a logical rule, an induction 
or a cut, (4) of a TRI-, TRI,- or a TRIQ-inference. In the cases (1)}-(3) we 
proceed asin [1], [7]. If S is the conclusion of an inference under (4) with 
premise S’ and if o(S’) = « has already been determined then we put o(S) = 
= «,((a4 w°)w*) where d and & are determined as follows: 

(1). d = A(S’)—H(S), 

(2) mGise/,, iftheanferencesis a TRI, 

(3). if the inference is a TRI, then ¢ is the ordinal of <; where e is the 

Gédel number of a proof of > W(<;,), 
(4). if the inference is a TRI’, then € is the ordinal of {<x, y>|x <;ya 
x, y <;m} where e is as before. 
The ordinal of F is the ordinal of its end-sequent. The verification that 
an essential reduction step lowers the ordinal of a normal proof is now only 
a matter of routine, which leads to the same calculations as in the case of 
ZT(<). As a corollary of this fact we reobtain now theorem 2 of section 1 
for the present relation V*. From theorems 1,2 we obtain theorem 4 for 
ZBi, and hence also for ZBi, but with the same additional assumption as 
in section 3. 

We sketch the proof of theorem 4. Let F be a normal proof of > (4x) 2 (x). 
In view of theorem 2 we can transform F by means of finitely many reduc- 
tion steps into a normal proof # which does not admit any reduction step 
at all. Then there is an inference in F’ which satisfies the requirements of 
theorem 1. It is now easy to see that this inference is the right-most one 
amongst all inferences having their conclusions in the final part and it must 
have the particular form 


Pr > X(q) 


I — (Ax) (x) 


for some constant term q. By omitting this inference we obtain the proof 
required by theorem 4. 

In order to state that form of theorem 3 which holds in our case we de- 
fine by induction the notion- of a positive formula: 

(1). a prime formula is positive, 

(2). if &, B are positive then UAB, WvB, (Ax), (Wx), (oc), (Wa) 


are all positive. 
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THEOREM 3*, If F is a proof in ZBi, of — YX with YX positive then there is an 
infinite constructive proof of > A. 

Proor. We will give a brief sketch of the proof. We proceed by induction 
with respect to the number n of logical symbols in 2 and for fixed n by trans- 
finite induction with respect to V*. In order to fix the ideas we consider an 
Y of the form $v. We assume for simplicity that 2{ contains no variables 
and exactly one constant a,,; we may therefore write WU(«,,) in place of I. 
Without restriction we may assume that Y does not admit any preliminary 
reduction step. 

Case 1. F is normal and admits an essential reduction step which trans- 
forms FY into Y’. After a possible application of some preliminary reduction 
steps to Y’ we obtain a A” which is normal, has the same endsequent as F 
and satisfies Y’V*F. The statement then follows by transfinite induction. 

Case 2. F is normal and does not admit any reduction step. Using 
theorem 4 we find a proof F’ of > 8 or > ©. The statement then follows 
from the induction hypothesis. 

Case 3. F is not normal. This means that not every term in the final part 
of F is saturated. Call a finite sequence v good if (1) v is an extension of w, 
(2) replacing a,, by «, in F gives a normal proof F,, (3) if v is a proper ex- 
tension of v’ then v’ does not satisfy (1) and (2). For each good v case (1) 
or (2) applies to F and so we obtain cut free proofs A, of > W(a,). With the 
aid of the w-rule we can piece the Y, together so as to obtain a cut free 
proof of > %(a,). 

Theorem 3* remains true if all quantifier-free formulas are admitted as 
positive. In the system ZBi only numerical parameters are allowed in <; 
however only routine additional work is needed in order to carry through 
the arguments of section 4 for a system in whicn < may depend on function 
parameters. 


5. We consider briefly a generalization of the above to stronger theories. 
To this end let < bea p.r. relation such that v < w iff v, w are finite sequen- 
ces and v is a proper extension of w; denote for simplicity by < also a fixed 
prime formula t(x, y) = 0 representing this relation. Denote by x < y the 
formula x < yAD(x)AD(y) where D is arbitrary. By W°(<) we denote 
the formula (€)—(x)(€(x+1) < &(x)). We note that the formulas p < q, 
q<t>p~<t,p<q— Dp), p <q —> D(q) have cut free proofs con- 
taining only propositional inferences. The extension ZBC of ZT* contains 
the rules and axioms of ZT* and for every formula D the following rule: 
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Dy), (*) zy A, F > 4, Aly) 
Tig = ee 
W*(<), D(q), F > 4, U(q) 


with q free for y in 2(y), y not free in the conclusion and < related to D 
as described above. 

From ZBC we can pass to its intuitionistic version ZBCi and to subsy- 
stems ZBCi, in a similar way to before. Without proof we note that (1) 
ZBC is as strong as classical analysis, (2) ZBC can be interpreted by means 
of the mapping 2 => 2° (see [5], p. 493) in ZBCi. Let ZBK be that subsy- 
stem of ZBC which is obtained by putting the following restrictions on the 
formulas D: 

(1). ® contains no function variables and no <,, 

(2). the free variables of D are x, z. 

We write x <, y for x < yA D(x, t) A Dy, £). The intuitionistic version of 
ZBK exists again, namely ZBKi and if we admit only proofs of degree n 
we get the subsystems ZBKi,. The passage from ZBKi to a conservative 
extension ZBKi’ is now performed in the same way as in section 4. The only 
difference is that the rules TRI? are now replaced by the following slightly 
different ones: 


y <um, (x)(x <,y > U(x), F > Wy) 
TRI? eS Fe 


e,ey é, ey 


q <,m, r 2 (4) 


where e, t, q have the same meaning as for TRI? while e, is the Gédel-num- 
ber of a proof in ZBKi of + D(m, t). 

For ZBKi, (defined as in section 4) we can introduce the following 
kinds of reduction steps: (1) preliminary steps, (2) logical reduction steps, 
(3) induction reductions, (4) TRI-, TRI,- and TRI%,,-reduction steps. 
Reduction steps of kinds (1)-(3) and TRI-reduction steps are defined in 
exactly the same way as in section 4. Reduction steps TRI, and TRIZ,, are 
almost the same as their counterparts TI and TI, in section 2 with the follo- 
wing exceptions: 

(1). the cases (1) in the definitions of the TI- and TI,-reduction steps in 
section 2 are omitted now, 

(2). the axiom y < t > d(y) which appears leftmost at the top in the first 
of the two figures in section 2 is now replaced by a propositional cut-free 
proof of y <, q > D(y, t) in case of a TRI,-reduction, 
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(3). the axiom y < p, p < t > y < t which appears leftmost at the top 
in the second figure of section 2 is now replaced by a propositional cut free 
proof of y <,q,q <;m — y <; m in case of a TRI? ,,-reduction. 

With these reduction steps we can apply the arguments outlined in section 4 
with almost no change to ZBKi,. As a result we reobtain theorems 1, 2, 3* 
and 4 for ZBKi;. We have considered ZBKi’ instead of ZBCi’ only in order 
to illustrate how to pass from ZBi’ with number parameters to ZBKi’. 
The theory ZBCi’ however bears to ZBi’ with function parameters about the 
same relation as ZBKi’ to ZBi’ with number parameters. It is therefore not 
difficult to transform the proofs of theorems 1, 2, 3*, 4 for ZBi’ with func- 
tion parameters into proofs of theorems 1, 2, 3* and 4 for ZBCi’. Lack of 
space prevents us from considering details. They will be presented elsewhere. 


6. We conclude by mentioning some further results without proof. 


THEOREM 5. (a). For fixed n we can prove in ZBi (with number or function 
parameters) the following three statements: 
(1). if ZBi, + > YX with U positive then there is an intuitionistic construc- 
tive infinite cut free proof of > XY, 
(2). if ZBi, + + Xv B where Xv contains no variables and no special 
function constants then ZBi, + — % or ZBi, + > 8, 
(3). if ZBi, > (Ax) (x) where (Ax) (x) contains no free variables and no 
special function constants then there is a term t such that ZBi,+ > U(t). 
(b). For fixed n we can prove in ZBC the three statements in (a) but with 
ZBCi, in place of ZBi,. 


A proof of theorem 5(a) amounts to reproducing for fixed n in ZBi the 
classical arguments of sections 2 and 4; and a similar remark applies to theo- 
rem 5(b). If we add to ZBCi the rule 


(63) — R(x), ro 
— (Ax) R(x), F > UW 


where ‘t is p.r. and may contain parameters of any kind then we obtain 
a theory ZBCi* for which the exact counterparts of theorem 5(a) holds. 
In order to state the last results, let ZbCi be the theory which differs from 
ZBCi in the following respects: (1) there is no induction rule in ZbCi, (2) 
the special function constants and their axioms are omitted. Then we have 
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the following two theorems whose proofs are very similar to those in 
section 4. 


THEOREM 6. If F is a proof in ZbCi of > UX with YW positive then one can find 
effectively a finite cut free proof F’ in ZbCi of > U. If F is a proof in ZbCi 
of — (Ax)U(x) where UX may contain any kind of variables then one can find 
effectively a term t and a proof F’ in ZbCi of > A(t). 


In this paper we have only presented the smaller part of the conclusions 
which can be drawn from our basic lemma 1. Detailed proofs together with 
other results will be presented elsewhere. 
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SECTION E 


PROOF-THEORETIC ORDINALS 


HEREDITARILY REPLETE FUNCTIONALS OVER THE ORDINALS 


SOLOMON FEFERMAN* 


Introduction. Orderings used in proof theory may be thought of as orderings 
of terms built up from constants and function symbols corresponding to 
functions on the ordinals Q, e.g. 0, +, exp, for the ordering of type e,.' 
The ordering < of the terms t is induced by the (natural) order of their 
values Val (t) in Q: 


1 < ts => Val (i) < Val (a) 


The function symbols faithfully represent the given system of functions 
F on < if for each t the order type |t|< of t in < is the same as Val (t). 
Equivalently, this holds if the system F is complete, i.e. it fills up an initial 
segment of @. Three things are needed to make the ordering useful for 
proof theory: it should be (i) faithful, (ii) recursive,’ and (iii) provably 
well-founded by elementary methods. 

This paper continues the general study of [3] of properties of systems F 
which ensure that the associated orderings <, satisfy (i) and (ii), and the 
study of operations on systems which preserve these properties. (iii) is not 
taken up here. The properties (i) and (ii) are not in general preserved 
under the well-known process of adjoining the critical junction of F (to F). 
However, it was shown in [3] that stronger properties of F, called repleteness 
and effective relative categoricity, are preserved under this process, as 


* Research supported by grant DA-ARO-D-31-124-G655. 

+ In proof theory one uses other orderings too, suggested by reductions in proof figures 
(Takeuti’s ordinal diagrams [12]). These orderings are, to begin with, partial orderings 
of proof figures induced by a reducibility relation, which are then embedded, more or less 
arbitrarily, in a total ordering. 

t The need for this is evident where proof theoretical reductions to quantifier-free systems 
are to be given (as is commonly possible). However, this is also required for reductions 
to constructive systems when the equivalence of the two notions of well-foundedness, 
WF and TT, is to be used. Kreise] [7] Technical Note 1V has pointed out how Brouwer’s 
argument for this equivalence depends on the assumption of decidability of the ordering. 
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well as under its transfinite iteration. These notions and results are reviewed 
briefly in § 1 below. 

By means of a pairing function in Qx Q— Q, systems of functions F 
can be coded by functions F; Q > Q, indicated here by Fe Q'. Then the 
process of adjunction of the critical function is given by a functional 
&: Q' — Q', or @€Q?. The process of transfinitely iterating such a ® 
can be regarded as given by a functional from Q? to Q*. Let Q"*! consist 
of functionals from Q” to 2". When formulated in this type structure, 
the notion of repleteness for F in Q' is readily generalized to a notion of 
hereditary repleteness for functionals of every finite type n. The main 
result obtained here (§ 2, theorem 3) is that the iteration functionals of type 
n (n > 3), when defined properly, are all hereditarily replete. 

A suitable generalization of the notion of effective relative categoricity 
to functionals of higher type has not yet been developed. However, it is 
possible to show that orderings induced by iteration in all types of the 
critical process have isomorphic recursive ordinals (§ 2, theorem 5). This 
is accomplished here by the use of effective operations over the recursive 
ordinals. 

The remainder of the paper consists of three brief sections. Some open 
problems suggested by the preceding notions and results are formulated 
in general terms in § 3. In § 4 we consider the relation between the functions 
induced by iteration of the critical process and the functions of Bachmann’s 
hierarchy [1], and in §5 the relation to other theories of functionals over 
the ordinals. These sections contain some further problems and conjectures. 


1. Systems of type level 1. Let @ be the ist uncountable ordinal; 
a, B, y, € 4, € range over Q. Ordinals are taken so that a = {€|/€ < a}. 
F:Q—-— Q is normal if it is continuous and strictly increasing; A ¢ Q is 
normal if it is closed and unbounded. Let @(F) = range of F. 


Basic facts 


(i). A is normal < there exists (unique) normal F with 4 = &(F). 
(ii). F normal = {€|F(€) = €} is normal; in this case, F’ denotes the 
normal function enumerating this set. 


Special facts 


(iii). If (Ag>g<q is a sequence of normal sets with A, 2 A, for § < 
and if « < Q then (),., A; is normal. 
(iv). If (F>¢<o is a sequence of normal functions with 0 < F,(0) and 
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R(Fe41) & A(F:) for each € and AF.) = ()\,<-A(F,) for limit €, then 
A€F;(0) is normal. 

The facts (i}-(iii) lead to the definition of a certain transfinite iteration 
of the critical process applied to any initial normal Fy with Fo(0) > 0: 
let F,4, = F, for « < Q, and let F, enumerate ()\,.,A(F:) for « a limit 
number. The sequence <F;z>z<g is called the first hierarchy beginning with 
Fy; further such hierarchies of normal functions are incorporated in the 
Bachmann hierarchy discussed in §4 below. By (iv), A€F,(0) is again a 
normal function. As an example, consider the trivial normal function 
Fo = A€(1+€). Then F,(E) = o*+€ and F,(0) = o*. The first hierarchy 
beginning with 1£(@*) is a sequence of functions x™, in the notation of 
[Sie tas (cave, (©) — Cth fixed point of x”, etc. The normal 
function A€x(0) has as its first fixed point the least impredicative ordinal 
IP aye 

The theory of [3] §§ 1, 2, deals with sequences F = <F,,..., F,,> of any 
functions of one or more arguments F;:Qx...xQ— Q. Let Cl,(A) be 
the closure under F of the set Av {0}. The set of F-inaccessibles 
Ing = {a|Cl-(«) = «} is normal; let F’ enumerate Ing. The members of 
Cl-(0) are represented by closed terms in a formal symbolism Tm(/) 
containing symbols 0 and /f,,...,f,,- Put t; <p tz if Val,(t,) < Valp(t). 
The basic properties wanted of a system F are 

(a). F is complete, i.e. Cl-(0) € Q, 

(b). <, is effective (recursive). 

Example. F = (i&, n(@*+1)> has both these properties. For this case 
one has Cl-(0) = é) and in general (F)'(€) = e. Adjunction of G to F 
is denoted by F * <G). For the specific example, F * <F’> = (AE, n(w* +n), 
AE(E<)>. 

As mentioned in the introduction, the properties (a), (b) are not in 
general preserved in the passage from F to F * <F’). This leads to the defini- 
tion of stronger properties: F is said to be replete if for each a, Clp(a) € Q. 
A suitable strengthening of (b) is more recondite: F is effectively relatively 
categorical (e.r.c.) if, roughly speaking, the order relation between any 
two terms t,(x,,...,%,), 12(%,,...,%,) can be effectively determined 
from the ordering among x,,...,xX,, provided that these all represent 
F-inaccessibles. 

The following illustrate the general theorems which can be obtained (I.c.): 

I. If F is replete (e.r.c.) then F * (F') is replete (e.r.c.). 
II. If F is replete (e.r.c.) and G, is the ath function in the first hierarchy of 
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normal functions starting with Go = F’ then F * <i€, nG(n)> is replete 
(erc.): 

Consider <AE(1+€)>, which is trivially replete and e.r.c. By II so is 
KAE(1+€), AE, n(@*+n)>, and then by I so is <Ag(1 +), AE, n(w* +1), Agee, 
and finally by II the same holds for (AE(1+€), 2E, n(w* +1), 2€, nx (n)>. 

Both I, II have the form: if F is replete (e.r.c.) then so is (F), where ® 
is a functional from systems F to extended systems F * <G). The functional 
in II can be considered to be a transfinite iteration of the functional in I. 
To generalize these results, one would want to deal with the iteration of 
arbitrary functionals ® which preserve repleteness (e.r.c.). This is done in 
the following section in a way which doesn’t depend on the special facts 
about normal functions given above. This gives generalizations at least 
for functionals which preserve repleteness. 


2. Hereditarily replete functionals. The following part of the finite type 
structure over Q is used: Q° = Q and @'*! consists of all F: Q" > Q". 
For simplicity, m-tuples of ordinals will now be (naturally) coded by 
ordinals, and sequences of functions will be coded by single functions.* 

Take <&, 47> = 2°(2n+1); this is bi-unique from 2x Q to Q— {0}. For 
€> 0, let € = <(€)o, (€), >; take (O)Gn— (0) 0 Giveniag Oyama ny, 
sequence <F,>z<,(F,¢ 9"), let Azc,Fs = G, where G(E) = Fig, ((€),) if 
(E)o <a, and G(é) =0 otherwise (enumeration). Let (Fo,..., Fn) = 
= Azem+1F:. Now take Clp(A) to be the smallest set which includes 
Av {0} and which contains <&,4> and F(€) whenever it contains 
é,. As before, In, = {a|Cl-(«) = a} and F’ enumerates Ing. Now take 
Cr(F) = (F, F’). Write Me Rp mifetor cach o(< 2), Cle@ )icemerut 
FX, Gif Clp(«) S Cl¢(«) for each «. If a sequence of functions F is coded 
as a single function F for all « and Cl,(«) is closed under + and exp, then 
Cly(~) = Cl,(«), and F is replete iff Fe Rp’. The following is a reformula- 
tion of theorem 1 of § 1, with additional observations. 


THEOREM 1. Suppose F € Rp’. Then 
(i). Cr(F)¢€ Rp’, 
(ii). F<, Cr(F), 
(ili). GE Rp’ & FX, G> Cr(F) SN, Cr(G). 
Proor. The proof of (i) is like that of § 1, I; the proofs of (ii) and (iii) are 
easy. 


* An alternative is to start with 2° = set of all finite sequences of elements of 92. 
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The functional Cr € Q?. We shall analyze § 1, If as a result about iteration 
of this functional It(Cr), where now It € Q°. This is obtained by extending 
the notion Rp, and the operation A to higher types. 

For any sequence (F,><,, & < 0, of members of Q"*", let A:<.F: = 
= GeQ"*! where for all He Q", G(H) = A;<,(Fe(H)). Again, take 
(fee ee ee Oimcache Ge Olan > 2, let (FoG)EeQ 
with (Fo G)(H) = F(G(#)) for each He Q"™*. 

For each n > 2, an iteration functional It"! € Q"*! is defined as follows: 
for each Fe Q", It"*+(F) = F where F = F and for each « > 0, 
Own =r or), 

Suppose Rp” (¢ 2") and SJ, have been defined (m > 1). A functional 
FeQ'*' is said to be hereditarily replete and we write Fe Rp"*! if 
F; Rp" > Rp". Example. Cr e Rp” (theorem 1(i)). 

ork. 2, 6 Rp!) define 


Fo 4,4, F, = for all Ge Rp", Fo(G)S, F,(G).* 


The subscript ‘n’ is dropped from S,, 1n the following. 
The following gives the basic closure conditions on the Rp”. 


THEOREM 2 

(i). Suppose n > 2 and F, Ge Rp"; then Fo Ge Rp". 

(it). Suppose n> 1,0<a<Q and FzeRp" for each € <a; then 
(Azcg fs) ‘Ss Rp". 

Proor. (i). This is trivial by definition. 

(ii). This is proved by induction on n. For n = 1, let F = (Age, Fe). 
First show for any B 


le Cl (icons ey © Clai8) oa]. 


When each F, € Rp’, the right hand side is a union of ordinals, so Cl;(f) 
is an ordinal. The induction step from n to n+1 is obvious by definition 
of Rp"*! and the commutativity of A with application. 


As a corollary, Fy,..:, F,¢ Rp" > (fo,..., &,,)¢ Rp". The proof of the 
following is now straightforward. 


* The definition of Rp"*? given in the talk imposed some additional conditions in terms 
of <j. These are satisfied by the particular functionals we are interested in, but can be 
dispensed with in general. 


294 S, FEFERMAN E XIx 


THEOREM 3. Suppose n > 2. Then 
(ole emp ae 
(ii). Fe Rp" = Fa it" *(F). 
ii). FGeRp& FQAG=It"*(F)alt"*'(G). 


As a corollary (new version of § 1, II): for any Fe Rp’, (It*(Cr?))(F) € Rp! 
and F< (It?(Cr*))(F). 

Given K & |Jo<," and A & Q, define Clx(A) to be the smallest class 
C & (J, 02" satisfying the following conditions: 

OP OO Xe, 

(6) Se MOE RCM NC, 

(fii). n=>1& Fy, FpEeCNQ" => (Fy, Fie, 

(iv). n>1& FECAD&GECHAQ"' = KG)EC, 

(v). n22&Fy, FpeCnQ=> FoF, eC. 

If K < | )o<,Rp" and K is countable, then K is replete in the sense that for 
each a < Q, Ch(a)n QEQ. 

Suppose given an enumeration of K = {H, H,,...,H,,...} and an 
enumeration of a = {a,...,a,,...}. Corresponding to these enumerations 
is a symbolism with terms of finite type. It contains for each m constant 
symbols of the same type as H,, and symbols 0 and a,, of type 0. If to, t, 
are terms of type 0 then <i), t,> is a term of type 0. If ty, t; are of type 
n> 1 then (to, t,) is of type n and for n > 2 so is (tf) o f,). If ty is of type 
n> 1 and {, is of type n—1, then t(t,) is of type n—1. Each term t of 
type v has a value Val (t) in Clx(a) A Q”. Define 


ty Xt, > ty, ty are of type 0 & Val (tp) < Val (t,). 


We call this the ordering between terms induced by (the given enumeration 
of) K and a. When K is replete, this ordering (after identifying ty, t; such 
that tp < t, and t, < ty) is isomorphic to the ordering of the ordinals 
< Cix(«) Nn Q. 

Let Ko = {AG(IEEC), Cr, It?) eee 


THEOREM 4. Ky © PRg, where PRg is the class of primitive recursive func- 
tionals over Q. 


PRg is the obvious extension to Q of Gédel’s primitive recursive func- 
tionals over @; cf. [4] § 1. It is shown in [4] §3 that PRg qn Q is a recursive 
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ordinal. The argument uses effective operations in metarecursion theory. 
The same argument establishes 


THEOREM 5. For each « < @, (the least non-recursive ordinal), Clx,(«) 9 
OO 

Proor. Actually a simpler proof can be given in this case, using ordinary 
partial recursive functions over O. The subscript ‘p’ in the following is used 
to distinguish the notions here from those in [4]. Let 


best 
feE,*’ <(Vg)Lg e E, > {f}(g) € El, 


Nr, (a, 4,0) -aeO & [al = « 
and 


Nr, (f, F, +1) 
<> fe Ett) & Fe Q"*’ & (VG, g){Nr,(g, G, n) > Nr,({f}(g), F(G), n)}. 


We say that fis a number of F if Nr,(f, F, 2) when Fe 2". Now it can be 
shown that each Fe Ky has a number v(F). Moreover whenever F,, F; 
of the same type have numbers, so do Fy o F, and (Fo, F,). It follows that 
for each ae€ O, each Fe Cl,,(|a|) has a number v(F); moreover the set of 
these v(F) is recursively enumerable. Hence {v(f)|B € Clx,(|al) a Q} is 
recursively enumerable and is a subset of O, for some y < a. 

Note that this argument does not establish the recursiveness of the 
ordering of terms induced by K, and a, but only of an ordering isomorphic 
to it. 


3. Open problems. The following problems are suggested by the general 
direction of this work. I do not have precise formulations, but the intent 
is clear enough to guide their study and to test the adequacy of proposed 
solutions. 

The following is the main problem. 

(i). What is the general notion of (transfinite) iteration of the critical 
process? It would not be surprising if the type structure had to be suppressed 
or abandoned in a solution of this. Moreover, it should include a solution 
to the following problem; this is stated here because it seems more tractable 
at the moment. 

(ii). What is a suitable generalization of iteration to transfinite types 
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(in the present context)? Part of the problem here is to find the right trans- 
finite type structure; this should automatically provide for a generalization 
of the notion of repleteness. In a sense, the consideration of countable 
K & \Jo<,Rp" is an extension to type a. Then we can consider the func- 
tional © of type w+1, (K)=Kvu {K’}, where K’ enumerates 
{EICl(€) 0 Q = €}, etc. The following economy can be achieved here. 
Given countable K Galo, Rp’, set Cl. (O)inu@)— (hy. eee 
Define K = A,<,F,- Then Ke Rp’ and for any «, C(x) n Q = Clz(a), 
independent of the enumeration Fy,...,F,,..-. Any such K is said to 
represent K. Then Cr(K ) represents @(K). This gives us a new basic func- 
tional Cr, ¢ Rp’, namely Cr,(F) = Cr(Ky v {F}). From this is obtained 
in turn Cr, ¢Rp* with Cr,(F) = Cr(K, vu {F}) where K, = {4E(1+6), 
CreGr alt, tse .}. Obviously this process can be iterated, but it is not 
yet clear how to organize this in a coherent way. 

Returning to the notation of §1, when an F is e.r.c., we get much more 
information about it than that it is effective (e.g. theorem 2.10 of {3]). 
Thus, although one can obtain various recursiveness results such as 
theorem 5, it would be hoped to solve the following problem: 

(iii). What is a suitable generalization of the notion of effective relative 
categoricity to functionals? If such can be found, it should be possible to 
show, for example, that the ordering of terms induced by Kg is recursive 
(cf. the remark at the end of § 2). 


4.1. Relations to (Bachmann) hierarchies of normal fuctions. As described 
in § 1, associated with any normal Fp satisfying Fy)(0) > 0 is a hierarchy of 
normal functions <F,>,<g Which are successively critical for the preceding 
functions. Then Gp = AEF,(0) is normal with Gp(0) > 0, so the hierarchy 
associated with Gy gives a second hierarchy for Fy, etc. The Bachmann 
idea [1] is to keep track of the iteration of this process by going into an 
initial segment of the 3rd number class: let Fo = A€F,.(0), then Fo, is the 
ath function in the first hierarchy starting with Fp, Fo.2 = A€Fo+2(0), 
etc. This provides normal functions F,: Q > Q for & ranging over any initial 
segment of the ordinals < 2, for which one has associated fundamental 
sequences of order type < Q with each limit number. (E.g. Fo. ,, enumerates 
Oz<oA#(Fo-2), while Fo: = AEFy..(0)). This is easily done for 
& < €94, = Ist fixed point of Q” = v for v > Q. (It is extended further by 
building hierarchies of normal functions over Q,; we return to this point 
below.) Using notation which has become fairly standard in this connection, 
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let @o = AE(1+€)and fora < é9,, let g, be the ath function in the extended 
hierarchy starting with g,.*t 

Note that fundamental sequences have to be considered in the definition 
of the g, in order to keep all functions normal and to apply the special 
facts (iii), (iv) of §1 at limit numbers. This problem is avoided in the 
definition of the iteration processes It" considered above by allowing non- 
normal functions and using the enumeration operator A to encode every- 
thing obtained at earlier stages. This suggests that it should be possible 
to cover the ~, by functions in Clx,(0) > @', or even extract them in some 
sense. 

The following relations are conjectured. Superscripts n are dropped from 
the It” where there is no ambiguity. 

For any a, 


Pa+aS (It(Cr))(9,). 
Forme, — liCr) 


Pat a-2 J Ea(@,), eee Pata-é << Hak(@:) when is & Gy 
and 


Patror SI (It(F ,))(9,). 
Det Fy —alt(2,) = Ir(it(Cr)). 


Pat m2.2S or aa F?(@,) nOrc < <e, 
and 
Pr+o3 A <I F3(9,), 


where F; = It(F,) = It(It(It(Cr))). In general 


Prva: J (I(CH(9,) for & < Q 
and 


Parga Si ((It(It))(Cr))(p,). 


In particular gga S It*It*? Cr gy (application associated to the left).If this 
holds one would guess that 


* Bachmann starts with go(&) = w$ which is here yp (&). For « = Q? we have Q+a = « 
and the function , is the same with either starting point. A minor difference is that [1] 
begins enumerations with 1 rather than 0. 

t An interesting comparison of the ordinals obtained by the Veblen-Schiitte functions 
[11] and by these hierarchies has been made by Levitz [8]. He shows that Veblen’s first 
E-number is just Pon(0). 
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Can at) i lees Sto Cras 
Qe 


&o+1 is the limit of Q, 2°, Q°,.... Thus every ordinal < @,,, (0) would 
be in Cl, (0) 9 2. 

By theorem 4, Clx,(0) 1 2 © PRg 4 Q. Howard (unpublished) has 
shown that PRp Nn Q < ¢9,,,,(0). This leads me to the following 

Conjecture. Clx,(0) 0 2 = 9,,, ,(0). 
Let Q, be the vth initial ordinal, with Q = Q,. The hierarchy , for 
a < &94, essentially uses the hierarchy of normal functions Wz = An(Q° +7) 
from Q, to Q,. It can be extended further by building up hierarchies of 
normal functions over 2,. This in turn requires consideration of funda- 
mental sequences in 923, etc. Such use of the 2, has been systematized to 
some extent in the (independent) work of Pfeiffer and Isles for finite v and 
more generally by Isles [6] for certain transfinite v. Assuming the conjecture 
above is correct, one is immediately led to the following question: What 
corresponds in the present approach to the use of the initial ordinals Q, for 
extending the hierarchies of normal functions? Some obvious candidates are 
supplied by an extension of the theory to transfinite types as anticipated in 


8 2p 


4.2. Remarks on the problems of associating fundamental sequences and 
well-orderings. An association of fundamental sequences in an initial segment 
of the 2nd number class is a relation (a, @) satisfying the following con- 
ditions: 
(a)(i). (a, g) => « is a limit number < Q and og = Jn < wg(n) is an 
increasing sequence of ordinals with « = sup,<,(n), 
(ii). 3@ @) &IaY) > 9 =Y, 
and 
(iii). G(«, gp) & B <a & Ba limit number > W)F(f, W). 
An association of well-orderings in such an initial segment is a relation 
Wa, x) satisfying 
(b)(i). Wa, x) => « < Q and x is a well-ordering relation of order type 
a with field < a, 
(ii). Wie, x) & Wa, y) > x = »y, 
and 


(iii). Wa, x) & B < « => (Jy) W/Z, y). 
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The domain a of QW is {a|(Ax)W(a, x)}; the domain ag of % is defined 
to be {y|(da > y)(Ap)B(a, ~)}. The existence of associations §, Y& with 
Oe = Q, Uy = Q, is trivial assuming the axiom of choice. 

The general aim of Bachmann’s work is to produce associations % with 
very large domains which are explicit, in some suitable sense. To see the 
matter in perspective, let us go over the simplest historical and logical 
relations. 

The interest in fundamental sequences goes back to a result of Hardy 
(1903) (reproved in Church [2]): A well-ordering of type Q of a subset 
of the continuum can be defined from any (association) % with domain Q. 
Hardy simply showed how to define a 3 with domain Q from such an %. 
Compare Lévy [9] who showed (1963): it is consistent with ZF + AC+ GCH 
that no formula of the language &, (of set theory) defines a well-ordering 
of type Q of a subset of the continuum. So there is no %, definable in 
&. for which (a)(i)-(iii) and a, = can be proved in this system, let alone 
ZF, 

The next logical question is whether for some countable « and possible 
7 and % with domain «, one can be essentially ‘simpler’ than the other. 
An answer to this question, in terms of axiomatic set theory, follows from 
examination of the Hardy-Church arguments [2]. Given a definition of %, 
we can read off a definition of 3 with the same domain; for the converse 
one uses a straightforward definition by transfinite induction. Put axioma- 
tically, the two steps can be formalized in a set theory without the power- 
set axiom, with the 2,-axiom of choice as its only non-trivial existential 
axiom. (This system is quite weak, of the same strength as 2j-AC in 
Friedman [5].) So, proof theoretically, the problems of finding § and W 
are, In an obvious sense, equivalent for systems containing this weak set 
theory. 

What can be said about a which are provably domains of % or ¥? 
Certainly, for Q¢ = the least constructibly uncountable « we have a W, 
with domain Q/, namely 


a < Q' & x is the first well-ordering of w in L of type a 


(‘first? in the usual well-ordering of L). Since Qf must be considered too 

‘large’ (at least for applications to proof theory), this implies that additional 

conditions should be imposed; for example (as suggested by Kreisel), add 
(b)iv). Wa, x) & WP, y) & B < «=> y is an initial segment of x. 


(It is less obvious what additional conditions on % would be natural.) 
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Question. For any %8 which can be proved (say in ZF) to satisfy (b)(i)-(iii) 
is there a %3’ which can be proved to have the same domain as YW and to 
satisfy (b)(i)-(iv)?* 

The general question of finding natural additional conditions on ¥8 seems 
to have some independent set-theoretic interest. 

To summarize: on the basis of the present evidence, there seem to be no 
logical advantages in associations % as compared with associations W. It 
might be considered possible that the results are formally simpler to state 
for 7% than for Y&. However, this possibility is slight if the conjectures above 
(concerning Bachmann’s hierarchies of normal functions and the functions 
obtained by higher type iterations of the critical process) are true. 


5. Relations to earlier work with functionals. Neumer has developed extensive 
classes of functionals over the ordinals, but without any general theoretical 
framework. The work has undergone a series of expansions and revisions 
in a number of papers; the most recent versions are to be found in the 
papers [10]. The ‘operators’ of [10] part I and ‘selectors’ of [10] part II, 
are both functionals of finite type, but the objects of type 1 are always 
normal functions; The operators /, 7’, / Wot theformerand’s, Ss 3S ae 
of the latter correspond, respectively to the functionals Cr, It?, It*,... 
defined here. In addition, Neumer considers certain symbolic operators 
which he calls ‘facients.’ The facient 14, is really a functional making use 
of the iteration of the process of forming, for any normal F € Q"', the function 
enumerating the inaccessibles of (JF, I'IF, I I'IF,...}. In terms of § 3, 
it corresponds to (It(Cr,))’. The facient 2, corresponds to (It(Cr3))’, ete. 
I expect that a systematic development of the ideas concerning transfinite 
types indicated in §4 would comprehend all the facients considered by 
Neumer.!* 


* It should be remarked that the domains of definable 98 provably satisfying (b) (i)-(iv) 
are the same as the types of provably definable well-orderings R in w, by R = U[x| 
(Ax) IB (a, x)]. 

t I was not acquainted with Neumer’s work at the time of my talk on §§ 1~4 at the con- 
ference. I wish to thank H. Levitz for bringing it to my attention. 

t More recently, and independently, iterations of functionals of type 2 were used to ana- 
lyze the ordinals of the Veblen—Schiitte system [11] by Levitz [8] and Page (dissertation, 
Oxford). Using the footnote of p. 000 this jibes with the bounds involving 9,4 9: 
conjectured in § 3. 
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FORMAL THEORIES FOR TRANSFINITE ITERATIONS OF 
GENERALIZED INDUCTIVE DEFINITIONS AND 
SOME SUBSYSTEMS OF ANALYSIS 


SOLOMON FEFERMAN* 


Summary. The first order systems ID, (for ordinals v given by primitive 
recursive well-orderings <), described fully in § 1, express roughly speaking 
that a familiar principle for defining specific sets of natural numbers can be 
iterated v times. Kleene’s definition of O is a typical instance of ID,. 
Such systems were first studied by Kreisel in [6], where the principle was 
referred to as that of generalized inductive definitions (g.i.d.). More precise 
information on background is given in § 2.1 below, after the various systems 
considered have been described in detail. ID <, is essentially (),<, ID,. 

The systems (J7}-CA) , in the language of classical analysis (containing 
variables for sets of natural numbers) express roughly, that there are 
hierarchies obtained by iterating the hyperjump operation any number 
less than v times. Such systems were first studied by Friedman [4]. 

Let (BI) (in the language of analysis) denote the familiar principle of bar 
induction, that is the equivalence of two senses of well-foundedness of 
primitive recursive orderings. 

The principal results of the present paper (§ 3) establish a strong form of 
“proof-theoretical equivalence’, called w, T-equivalence, between the following 
systems: 


(3.1.4) For v = w’*?', vy <8, (11{ -—CA)<,+(BI) is w,T-equivalent to ID,. 


(3.1.7) For v = w’, with y a limit number and v < &, (II{|-CA)~.,+(BD), 
(I1;-CA)., and ID<, are all w,T-equivalent. 


* Research supported by grants DA-ARO(D)-31-124-G985 and NSF-GP-8764. A pre- 
liminary version of this paper was presented to this conference under the title “Note I: 
GID,, and the IT i-comprehension axiom’ of ‘Two notes on theories GID, given by » itera- 
tions of generalized inductive definitions.’ (The version printed here does not include all 
the material presented at the conference. The rest is contained in [2]. Eds.) 
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@,T-equivalence and a slightly weaker relation = are explained in § 1.5; 
when S = S’, (there is a finitist proof that) S, S’ have the same arithmetic 
theorems and, in a suitable sense, the same JJ}-theorems. 

The following corollaries establish relations with more familiai sub- 
systems of classical analysis: 


(2.2.1) For v = @, (IIj-CA)+(BI) is w,T-equivalent to ID,. 


(3.2.2) For v = w®, (A,-CR) is w,T-equivalent to ID.,. 
(3.1.9) Forv = 2,,(4;-CA) = 1D 


In fact, the argument for the first of these is the basis for the more general 
theorems above. In turn, it extends arguments in § 2.1 for related results 
concerning ID, and systems (IJj-CA)~ and (BI) without set parameters. 
The theorems of §3 were developed from these in collaboration with 
H. Friedman. 

The significance of these results for constructive foundations of analysis 
is considered in § 4 and for the work of Tait [12] and Takeuti [15] in § 5. 


§ 1. Preliminaries 


1.1. Languages used. All the languages considered are sublanguages of 
a’ 2nd order language’ 275(P5,P15-. .) were 2o, 1,40) ate “namerder 
(set) constants. The Ist order vocabulary has the (numerical) variables 
X,),Z,..., the constant 0, and a symbol for each primitive recursive 
function; the (n+1)st numeral o—— is denoted by nn. t,t,,... range over 
terms; the atomic formulas are (t; = t,) and (t¢P;). The full language 
has in addition the 2nd order (set) variables X, Y, Z,... and atomic 
formulas of the form (t¢ X). With any finite or infinite sequence <k;>j<m 
is associated the sublanguage Y,(P,,, Px,,---) having only the P,, as set 
constants; this is written Y, when m= 0. #,(P,,, Py,,---) (or simply 
L£, when m = 0) is the sublanguage with no set variables. 

Let # be any sublanguage of 275(P,. P14)... ll theories 1 ine 
will be assumed to contain the axioms and rules of classical two-sorted 
predicate calculus, with equality axioms in the Ist sort. When Y is con- 
tained in £,(Po, P,,...), this is just the usual Ist order logic. In this case 
the P; can be treated as predicate constants and we may write P;(t) instead 
of (t¢P;). In addition, for any & it will be assumed that T contains all 
axioms of elementary number theory Z in #, i.e. the usual axioms for 
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0, ‘, and the defining equations for the primitive recursive functions, 
together with all instances of induction: 


BO) A AxLBX) > BO) > Ax), 


where {5 is any formula of #. 

By the (set) parameters of a formula we mean its free (set) variables. An 
arithmetic formula is one without set quantifiers or set constants; it may 
contain set parameters. 2, 8 range over arithmetic formulas. An arithmetical 
formula in which all quantifiers are bounded is said to be elementary. Kt 
ranges over the elementary formulas; these include only formulas cor- 
responding to relations primitive recursive in their parameters. The usual 
notations of recursion theory are used both formally and informally. 
om aiswiitien for 2° 37) and’ xc X, for. xy) eX. 

For any formulas %(X) and G(x) (perhaps with other free variables), 
(G(x)), or simply %(G), denotes the result of substituting G(t) for each 
occurrence (t{e¢ X) in %. Functions are identified with sets X such that 
AxVly<x, y> eX; a, B range over functions. Thus a(t,) occurs in a 
formula only in contexts a(t,) = t,, ie. <t,,t.>ea. Given %(«) and 
G(x, y) where AxV!yG(x, y) has been proved, §(AxG(x, —)) denotes 
the result of substituting G(t,,t,) for each occurrence a(t,) = t, in %. 
The notations %(&(x)) and §(G(x, —)) are explained in terms of these. 

The well-foundedness of a relation Q(x, y) is expressed by the formula 
WF(Q): AaVx ~ Q(a(x+1), «(x)). The principle of transfinite induction 
w.r. to O in F is the schema 


THQ, 8): Ax{A LQ, x) > SO) > G&)} > Ax) 


wheie % is any formula of Y. When Q(x, y) is written (x < y) these are 
written WF(<) and TI(<, %). Let <,, be the natural ordering of order-type 
€) in w. For each v < &9, let <, be the initial segment of <,, with |<,| = v. 
All instances of TI(<,, %) in & are derivable from Z in & by [11] pp. 
201-211. 


1.2. 1D,. Let M1, %,,... be an enumeration of all arithmetic formulas 
U(X, x) of L, containing at most X, x free, and such that each occurrence 
(te X) in Wf is positive. For each UW = WU; write Py for P;. Let Y be a 
sublanguage of £,(Po, P;,...) containing Py. The axioms of ID,(2) in 
£ (in addition to those mentioned in 1.1) are 
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(1.2.1)(i). A x[2l(Pa. x) > Pu(x)], 


and 


(ii). A x[2U(%, x) > B(x)] > Ax[Pa(x) > B(x)], 


for each formula 8 of #. ID, is the union of ID, (2;) in #,(Po, P,,...). 
Unless otherwise indicated, any one ID,({) is always taken in &,( Py). 

Consider any 2 = ,;. A set X is called %-closed if \ x[QU(X, x) > xe X). 
The positivity condition on 2 ensures the monotonicity condition 


(1.2.2) X SY > Ax[W(X, x) > ACY, x)]. 


(Here and below, X © Y abbreviates Ax[xeX—>xeY] and Y=Y 
abbreviates A x[xeX<+xeY].) Monotonicity in turn ensures in set 
theory that there is a least 2{-closed set. In other words a model for ID, (2) 
(in any &) is obtained by defining 


(1.2.3) Pa(x)o A X{A y[U(X, y) > ye X] > xe X}, 


where X ranges over arbitrary sets. 

Thus (1.2.11) expresses the existence of an 2-closed set Py and (1.2.1ii) 
expresses that Py is least among the -closed sets defined by formulas $ 
of &. (i) is also often called the inductive definition of Py and (ii) the 
associated principle of proof by induction on Py. 

An important example of an ID,(2l) is provided by the inductive defini- 
tion of the accessible part of a relation <, taking 2(X,x) to be 
Ayly <~ x > ye X)]. Set theoretically, the least Y{-closed set is seen to be 
the same as {x: < | x is well-founded}. In this case (1.2.1)(ii) is just 
TI(<*, 8) where <* is the restriction of < to its accessible part. O can be 
defined to be the accessible part of a rec. enum. <. A closely related induc- 
tive definition of a set T of recursive trees (in 1.4) will be used extensively 
throughout the paper. 

For each % = %;, the monotonicity condition is logically derivable. 
Equivalently, 


(1.2.4) A y[P(y) > P’(y)] > A x[U(P, x) > WP’, x)] 


is derivable in Y,(P, P’). Kreisel has remarked ([7], p. 138) that conversely 
any 2 for which this holds is equivalent to one of the 2;. This extends also 
to the case that (1.2.4) is derivable from axioms of the form S(P) u S(P’) 
where S(P) consists of sentences 6(P) in #,(P). For then there are 8,(P) 
in SCP): = Ieee such tat 
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 AyLPO) + POM TI B(P) AMP, x) > [TT B(P) > MP", 


Hence by Lyndon’s interpolation theorem [10], we can find a formula 
M,(P’,x) such that P’ is positive at all occurrences in %, and such that 


A yLPO) > POM TBP) AMP, x) > WP», 
and 


= SiC) I B,(P’) > U(P’, x)]. 


Hence S(P)+ [Q(P, x) — U,(P’, x)]. 


1.3. ID.., ID, and ID.,. We now consider theories for iterations of induc- 
tive definitions relative to any given prim. rec. well-ordering <. In this 
section, let Mf), %,,... be an enumeration of all arithmetic 2(X, Y, x, y) 
in #, containing at most X, Y, x,y free, and such that each occurrence 
(t € X) in W is positive. For each YM = YW, write P™ for P;. Now write P}(x) 
for <x, y>e€P™. Let Y be a sublanguage of Y,(P), P,,...) containing 
P™. The axioms of ID_(U) in Y are 


(Coes CR ae Nee ylQucP ys 2((z), < rately x,y)> Peak 
(ii). A y{A x[2U(®, 2((z)1 < yA Po,((Z)o), x y) > B(x)] 
oe eB), 


and 
(iii). TI(<, 8), 


foreach Bin”. 1D. is the union of ID_(Y! ,) in &,(Po, P;,...). Unless 
otherwise noted, any one ID_.(2) is taken to be in #, (Pq). 

Given 2 = %,, a model for ID_(Y) in any # is set theoretically defined 
by the following transfinite recursion on < 


(1.3.2) PY(x) > AX{Au[U(X, 2((2)1 < yA Pe,((Z)o)s & y)> we X] 
> xe X}. 


Now the positivity of X in Y(X, Y, x, y) ensures that for each y, P¥ is the 
least set satisfying the closure condition / u[.. .]. 

A variant of the preceding definition of ID~ applies to any prim. rec. 
relation <, giving theories for iterations of inductive definitions along its 
accessible part <*. This is obtained by replacing < by <* in (1.3.1i, ii) 
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and by changing (1.3. lili) to the axioms of ID, for the accessible part of <. 

When | < v < é and < is <, we shall write ID,(), ID, for ID. (Qf) 
and ID., resp. As noted in § 1.1, we can omit (1.3.liii) in these cases. The 
results stated in this paper will concern various ID,; many of these can be 
generalized to arbitrary ID.. The restricted formulations are taken for 
simplicity and because they suffice for the known applications of interest. 

For2 < v < @, & = A, and Y containing P™, we define ID.,(M) in F 
to be the union of ID, (2) in for 1 < p < v.* This is contained in ID, (2) 
but may be weaker since we only have (1.3.1i, ii) for each section <, of <,. 
We take ID, to be the untontot all Oe in eee 


1.4. T,. The inductively defined sets T, to be discussed now play a central 
role in the reduction of certain subsystems of analysis to ID,. It also follows 
from these results that ID, can be reduced to ID,(2), where 2 determines 
the closure conditions for (T,4,>,<y. Ty, is the set T of Gédel-numbers 
for the well-founded recursive trees. The set O, = O and the higher con- 
structive number classes O, could serve the same purposes; the 7, have 
some technical advantages here. 

Throughout the following, s, 5; range over sequence numbers. s, ¢ s 
if 5s, represents a proper initial segment of s; < > is the number of the 
empty sequence. For any set Y, T” is defined to be the set of all e such that 
(i) {e}¥ is total, (ii) AaVyfe"(@(y)) #0, and (iii) As,s,[s,; = 5A 
{e}¥(s) = 0 + {e}¥(s,) = 0]. The branches of the tree determined by e 
are the (sequences represented by) sequence numbers s with {e}%(s) = 0. 
The tree may be empty. For e € T” take |e| to be the ordinal of the associated 
bec: 

The main result about T” is that it is complete for the predicates Ij in Y. 
This is by the standard unsecured-sequences argument: Given R(s) with 
set parameter Y, associate a total {up}" with 


{up}"(s) = 0 As,[s, os > ~ R(S;)1.- 


Then AaV yR(&(y)) <> ug eT’. The set theoretical assumptions in this 
argument are examined formally in § 2.1 below. 

Let S be a prim. rec. function such that for any Y, e, s, and s, 
{S(e, 5,)}¥(s) ~ {el¥(s, #5). We write e | 5, for S(e,s,) and e [ <n) for 
e [ 2"t1, When ee 7", e | s, is a number for the subtree below s,. The 


* One could consider more generally ID < (CW 1 <wey)s this would require assignment 
of distinct symbols P™x,# to distinct y. 
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following holds for any x, Y: 


(1.4.1) 7” is the least set X satisfying \ x[Q,(X, Y,x) > x€X], where 
y(X,Y,x) is \zV w({x}"(e) = w)a LA s({x}"(s) # 0)v A(x MudeX)] 


The proof that 7’ satisfies the closure condition is direct by definition. 
For X satisfying the condition, [e¢ T’ + e€ X] is proved by induction 
on |e]. 

Let Ty = @ and T,,, = T’ for n < a. This is extended to a definition 
of T, for v < &) by working with < = <,,. We can assume that < has 
field w, 0 is the least element in this ordering, and that the successor b @ 1 
of each element b is 2°. Let Lim (b) hold if 5 is a limit element, ice. 
b #0, b¥ (b)) @ 1. T, is defined by recursion on b as follows: 


(1.4.2)(i). T, = @ for b =0, 
(ijaiee=9 7: (an for b — (b), eleand 
(iii). x € T, (x), < DA(x)o€ Ty,, for Lim (5). 
We also write 7, for 7, when |b] = v in the ordinary <; in this sense, T, 


is the same as above. 
Pet l= v <<, |a|— vy, take w,(x._¥, x, y) to be the formula 


y <an {Ly = (y)o ® LAM,(X, Yoyo X> y) 
v [Lim (y) A(x); < yA(x)o € You, I}. 
X is positive in 2. By (1.4.1, 2), the model for ID,(2l,) given by (1.3.2) has 
PY(x) y <anxeT,. 


We shall write ID;, for ID, ,,(2,.,) and 7,(x) for P}'+*(x) in the theory. 
Also, ID;, <, is written for ID<¢1+,(U1+,). 


1.5. w, T-models. We shall consider relationships between certain subsystems 
of analysis and systems ID, and ID;,. The latter always contain ID,,. 
We can identify this with ID, (2,) where 2,(X, x) is U(X, 9, x) of (1.4.1). 
In this case we write ID; for ID;, and T(x) for Py,(x). The set theoretical 
definition (1.2.3) of a model of ID; is expressed by a formula of #2, 
namely 


(1.5.1) AX{A y[Ql7(X, y) > ye X] > xe X}; 


this formula is also denoted by T(x). 


310 S. FEFERMAN E XxX 


Suppose S, S’ are sets of axioms in 2, 2” resp., where 7, 2 Sare 
sublanguages of £,(P,,P,,...). A relative interpretation of S in S’ 
({16], pp. 20-21, 29) or uniform S’-model of S ([8], p. 364) assigns to each 
symbol of ¥ a definition in #’ and to each sort of variable in ¥ a range of 
variation given by a formula of #’, such that the translation 5’ of each % 
in Sis provable in S’. By an w-model of S in S' we mean a uniform S’-model 
of S which leaves the symbols and range of variables of Y, fixed. 

Let Sj; be a set°oi axioms in 4% = —7,(P,. P,..)and Sj a semen 
axioms in #,. An w, T-model of S, in S, is an w-model for which the 
translation (T(x))’ is provably equivalent to T(x) in S,. An w, T-model of 
S, in S, is an @-model for which the translation (T(x))’ of (1.5.1) is provably 
equivalent to T(x) in S,. In this case (7(x))’ has the form 


A z{M(z)a A y[lr(z, y) > ye’ z] > xe’ zh, 


where (y €’ z) is the formula defining (y €z) in Y and M(z) is the formula 
of & giving the range of variation of the 2nd order variables. 

A theory S is said to be w, T-reducible to a theory S’ if every finite sub- 
system of S has an w, T-model in S’. Sand S’ are said to be w, T-equivalent 
if each is w, T-reducible to the other. In every case in the following where 
it is asserted that a theory S has an w, T-model in a theory S’, or is @, T- 
reducible to S’, we have a finitist proof of this fact. It follows that S is 
proof theoretically reducible to S’ in the following strong sense: we have 
a finitist proof of the fact that every sentence provable in S, which is arith- 
metic in the formula T(x), is provable in S’. We write S < S’ when this is 
established; trivially, S < S’ implies that S is consistent relative to S’. 
We write S = S’if S < S’ and S’ < S. 


1.6. Some subsystems of analysis. Let FY be any collection of formulas 
@(x) of L, (parameters permitted). The #-comprehension axiom schema 
(F-CA) consists of (the universal closures of) all formulas 


(1.6.1) VXAx[xe Xo §(x)], 


where %(x) is in F and X is not free in (x). (§-CA)” is (--CA), where 
% consists of the formulas in %§} without set parameters. The %-axiom of 
choice schema (%-AC) and its restriction (-AC) are obtained similarly 
from (ACp,) of [8], p. 326, when § consists of formulas %(x, «). 

Let JI} = 29 be the collection of elementary formulas R(x). All the systems 
in £, considered here are assumed to contain (JTj-CA). Each system is 
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named by its corresponding additional axioms. (72-CA) = (£2-CA) is 
derivable (over (I7p-CA)) from the instance 
VXAx[xeX o A (Xx, YE Y)] 


of (IT?-CA). (ITj-CA)” has the additional axioms VXAx[xeXo 
Aa yR(x, &(y))], where ® contains no set parameters. (J7{-CA) can 
be taken to consist of all sentences 

ABV XAx[xe Xo AaV yR(x, ay), B(y))]- 


(11?-CA) is derivable in (IT}-CA). The systems (J7|-CA)<, mentioned in the 
introduction will be defined in § 3 below. 

The (classical version of the) system (BI) of bar induction is given by 
the additional axioms 


(1.6.2) WE() > TI(S, $). 


(BI)~ denotes the restriction to the case that {t contains no set parameters, 
but where no restriction is made on %. 


Lemma 1.6.3. [A XU(X) > U(%)] is provable in (11-CA)+(BI); it is 
provable in (I12,-CA)+(BI)” when A XX(X) contains no set parameters. 

Proor. The standard reduction of 2(X) to [j-normal form associates 
an R(X, 5s) such that 


(11,-CA) F [U(X) o ABV xRCX, B(x). (1) 
This is done in such a way that with each % is associated a © s.t. 
F AxV! yG(x, y) ATV xR(F, GE, —)) > AS). (2) 
For example, suppose %(X) has the form VxAyt(X, x,y). This is 
equivalent to ABV xR(X, x, B(x)); moreover, [V xt(X, x, Bo(x)) > 
VxAyit(X, x, y)] for the function Bo(x) = (uy) ~ R(X, x, y). In this 
case take ©(x, y) to be the formula 
[~ RB, x, YA AZ < pRB, x ZI VIA ZR, x z)Ay = 0]. 
(BI)E AX ABV xX(X, B(x)) > V xR, G(x, —)). (3) 


More generally, for any %(s) and © such that Ax V!yG(x, y) has been 
proved we have 


(BI) AaV x8(&(x)) > VxR(G(x, —)). (4) 
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This follows from [WF(®,) > TIQR,, G,)] where Ry (s, s‘) is 
[sea(s) A Sea(s’) a Az < Lh(s) ~ Rs =)A(s' <5), 
and ©,(s) is 
[Seq(s) A Az < La(s)@(z, (9)-1) > VGC, -))] 


§ 2. Relationships between ID,, ID,, and subsystems of (17;-CA)+(BI) 


2.1. Relationships between ID,, (11;—-CA)7 and (BI)~. The following are 
the known results on these systems; they are due to Kreisel (proved or 
sketched in [6], [7] p. 137, and [8] § 10): 


(2.1.1)(i). ID, = IDo. 
(ii). (BD < (Bl) < Wo <i). 
(iii). ID, has an w-model in (IT2-CA)+(IT}-CA)~. 


Actually, these are special cases of Kreisel’s results principally concerning 
intuitionistic systems, which will be considered in $4. In particular, the 
proof of (ii) depends essentially on the use of such systems. 

Though the results (i), (ii) are sharp for ID, , IDp (or equivalently ID;), 
(BI)~ and (BI), they leave open the exact relationship between ID, and 
IIj-CA~. This is established by the following theorem. More importantly, 
the model-theoretical formulation and proof are the basis for the results 
below relating various ID, with subsystems of analysis. 


THEOREM 2.1.2. 


(i). ID, has an w,T-model in (11-CA)+(BI)~, as well as in 
(112 ~CA)+(IIj-CA)~. 

(ii). (II;-CA)" + (BI)~ Aas an w,T-model in ID. 

Proor. (i). The definitions of the w-models of ID, are the same in both 
cases, but the proofs differ in the treatment of the principles of proof by 
induction. For each Py of ID,, translate Py(x) by 

Pa(x) = AX{A y[U(X, y) ~ ye X] + xe X}. (1) 


This determines a translation of #,(Pg,, Pa,,..-) in &, preserving Y,. 
The translation 


A x[U(Pa, x) > Pa(x)] (2) 
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of (1.2.11) is logically valid and so holds for both systems. For, given any 
two formulas %(x) and G(x), the proof of monotonicity applies equally 
well to show 


F A z[(z) > G(z)] > Ax[ Q(B, x) > WG, x)]. 
It follows that 
F A y[UCX, y) > ye XJ > Ax[U(Pa, x) > U(X, x], 
so that (2) is derivable. 
The translation of (1.2.1ii) is 


/\ x[U(B’, x) > B'(x)] + AxLPa(x) > B'(x)], (3) 
where 
Bia Ope as x) 
is the translation of 
S505) Senn oo ee 
First consider (3) in (IT}-CA)~. For each i, 
(IT,-CA)+(IT{-CA)” # VY Ax[x€ ¥ @ Py(x)] (4) 
by reduction of (1) to I7{ normal form. Then also 
(HI¢,-CA)+(1j-CA) + VXAx[xeX — B(Py,,...Pa.x)] (5) 


giving derivability of (3) in this case. 
To obtain (3) for the case of (BI)~, apply lemma 1.6.3, taking Py(x) 
for the formula / XU(X): 


(1T,-CA)+(BI)” F Pa(x) > {A y[X(B’, ») > BY] > B)}. © 


Let us now turn to the proof of part (ii). We formalize the proof of the 
basis result for Ij predicates: A aV x8(&(x)) << A ay V x¥i(&(x)), where 
M is the collection of sets recursive in 7. Thus M is trivially an @-model 
for (I1;-CA)-. But also if WF(%) is true in M then § must actually be 
well-founded; hence TI(R, 7) will hold in M for any formula %. Since 
is definable in ID; it need only be verified that the formal principles of 
ID; are sufficient to derive these facts. 

Let M(x) be the formula A y V z({x}"(y) ~ z) of #,(T) and (ye’ x} 
the formula ({x}"(y) ~ 0). The range of variation of the 2nd order variables 
of £, under the translation is given by relativization to Jt. This and the 
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formula (y e’ x) determine a translation of #, into #,(T) preserving the 
symbols and variables of #,. 

Given St(x), the 2nd order definition of being a function is translated as 
Ay Vi!2({x}"(Xy, z>) ~ 0). With each such x is associated x, with 
{x37 (y) = (uz)({x}"(Ky, z>) ~ 0), so M(x,). Conversely, every x, in M 
can be found in this way from its graph x in JM. Hence the translation of a 
formula AaVyR(a(y)) is equivalent to /A x{M(x) > V yR({x}7(y))). 
The latter will be written A op V yR(a(y)) in #,(T). 

The main step is to formalize the unsecured-sequences argument for the 
II;-completeness of 7; cf. § 1.4. Given X(s) (which may contain numerical 
parameters) the number ug is associated (uniformly in the parameters) 
with {ug} total and {ug}(s) ~ 0 Asi[s, cs > ~ R(s,)]. Let 


Secy(s) = T(us Is); (7) 


this expresses SR-securability of s. With this identification, the inductive 
definition of Rt-securability: 


[H(s)a A sls) es > ~ Hls,)] v A y Secg(s * <y>)] > Secg(s), (8) 
and also proof by induction: 


{AsLR(s)A Asi[sy os > ~ Rsy)]v AyB(s * <y>)] > BS} 
> As[Secy(s) > B(s)], (9) 


are both derivable in ID;. Applying (9) we get 
ID; + Secg(s) > A o%p[@(Lh(s)) = s > V yR(a(y))]. (10) 
On the other hand, ~ Secg(s) > V y ~ Seca(s # <y>). Thus if ~ Secy(s,) 
we get ain It with Ay ~ R(a(y) and &(LA(s,)) = s,, by 
a(x) = (uy) ~ Secy(a(x) * <y>) = (uy) ~ Tum [(@(x) * <y>)) 
for x > Lh(s,). Formally, 
IDy + Adg[X(Lh(s)) = s > V yR(&(y))] > Seca(s). (11) 


In particular, A ag, V ySt(@y)) > As Secy(s). Thus the translation of (BI)~ 
is provable in ID; from (9) and (11). To obtain (IIj-CA)~, consider 
/\ oy V yR(x, &(y)); here ug depends on x, ug = f(x) with prim. rec. 
(7), (10) and (11) yield 


IDp FE Adm V yR(x, a(y)) Tf (x)). (12) 
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This proves V zmAx[x€' zo Adm V yR(x, &y))]. The fact that this is 
a T-model also follows from (12). 


It does not follow from these results that ID, has an w, T-model in ID;. 
The obstacle is that the relative arithmetic CA does not hold in Nt. However, 
we can modify the proof of theorem 2.1.2 to obtain the following. 


THEOREM 2.1.3. ID, is w, T-reducible to IDr. 


Proor (sketch). Let S be the subsystem of (179-CA)+(II}-CA)~ con- 
sisting of all instances of 


VXAx[xeX @ (x)] 


where % has no set parameters and no nested 2nd-order quantifiers; in 
other words % is arithmetical in some formulas of the form / Y(Y). The 
proof of theorem 2.1.2i (esp. (1), (5)) also shows that ID, has an w, T-model 
in S. It thus suffices to show that any finite subsystem S, of S has an 
w, T-model in ID;. This is obtained by taking St(x) to define the set of all 
numbers x of functions recursive in T (nth jump of 7) for sufficiently 
large n. 

These model-theoretic arguments do not give the proof theoretical 
equivalence (2.1.1) of ID, with (BI). 


2.2. Relationships between ID,, and (II{-CA)+(BI). The main results 
here are simpler to state because (12-CA) is derivable from (IT{-CA). 


THEOREM 2.2.1 

(i). ID, has an w, T-model in (II;-CA)+ (BI). 

(ii). (7,;-CA)+ (BI) has an w, T-model in IDz,,. 

Proor. (i) By definition, ID, = ID. for < = <,;; this has the same 
theorems as ID. using the ordering < of w. Let S = (I1j-CA)+(BI). 


For each P™ of ID,, the definition of P}(x) in &, is obtained from (1.3.2) 
as follows. Let 


Q(X, vy) = AY{AxLMY, 2((z)1 < VA(Z)o € Xe.) VY) > XE Y] 
= i 1G) 


which expresses that X, is the smallest set satisfying the inductive definition 
from (X,>,<y. Then let 


Feues) = VxX[Ayi < yO(X, yi)axe X,], (2) 
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(IIj-CA) + AyVZAx[x eZ @ PY(x)’]. (3) 

This follows from 
(Ij-CA)t AyV XA yi < y&(X, yx), (4) 
which is proved by induction on y. We also have unicity of such X (up to y): 

(1T{-CA) F Ny, x, XTAY < y&(X, Vi)AG(X’, y1) ae Xy = XI. (5) 
It follows from these that the translation of axiom (1.3.1i) of ID, is provable 
in (ITj-CA). 

In contrast to the proof of (2.1.1), we cannot in general prove 
VZAx, y[Xx, y> € Z  P}(x)'] in (H{-CA). Here we must use (BI) to 
get the translation of (1.3.1ii), which is (equivalent to the universal closure 
of ) 

Ayi Sy O% yi)a(xe X,) > 

(A uLU(B’, 2((Z)1 < va(Z)o € Xi,), 4» y) > B(u)] > B)}- (6) 
This is provable in (BI) by (1) and lemma 1.6.3. Thus the translation (2) 
determines an w-model of ID, in (JTj-CA)+(BI), which is easily checked 
to be an w, T-model. 

For (ii) we formalize the argument that M = Weenee forms an 
w-model of (I1j-CA)+(BI), since Rc™*! is a basis for predicates 
AaV yR(&(y), B(y)) where f is recursive in T,. Formally, let St(x) be 
the formula AyV z({(x)o}7@1(y) = z) of Y,(%,) and (ye’x) be 
({(x)o}7+(y) ~ 0). This determines a translation of &, preserving 2, 


and relativizing the 2nd order quantifiers to Jt. Given x such that W(x) 


write B,, for Ay{(x)o}7™ (y). 

The proof extends the argument for (2.1.2ii). Consider a formula 
A dy V yR(E(y), B.(y)). A number uj of the tree of unsecured sequences 
of 9i(s, B.(LA(s))) is associated with R, uniformly in x. Let 

SeoS(s) = Tey, +:1(tsh I's). (7) 
Proof by induction on Secy follows from the inductive description of 
Tix),+1» and hence 


IDz,, | M(x) A Secy(s) + A %pl&(LA(s)) = s > V yRE(y), By) (8) 
By direct argument 
Dz, M(x)A ~ SecH(s) > V af Re7™™.**(x) AX(LA(s)) = s 
ANy ~ Ray), BO), (9) 
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and thus 


IDz, + A Bar V of A On V yRE(y), BYY)) > Th 1(u)]- (10) 
The translations of (I1j-CA)+ (BI) follow as in (2.1.2ii). 


§ 3. Relationships between ID,, ID, and subsystems of analysis 


3.1. Relations between ID,, ID.,, (HI 1-CA)<, and (BI). These relation- 
ships are stated for v < é), using < = <,,. All of the results can be 
generalized to any prim. rec. well-ordering < by adding the appropriate 
instances of TI(<) to each system. 

Let ja] = win <, a#0. X isa hierarchy based on the hyperjump up to 
a (or «) if 


(3.1.1)(i). for each b @1 <a, X,q, = T™, 


and 
(ii). for each b <a with Lim(b), xe X, (x), < bA(X)o € Xe, 


By (1.4.1), the equality in condition (i) is expressed in Y, by the formula 
A x[x € Xpq1 7 A Y{A y[Ur(X, X4,¥) > ye ¥] > xe Y}]. Let Ha, X) 
be the formula of &, expressing (3.1.1i, ii); for fixed a this is written 
§,(X) or ,(X). Let , be the sentence 


Cnr Zz XID. Ax, = ZI. 


> expresses that for each Z, T” exists. For «, B21, (ADs > De+g) 
is derivable in (I79-CA). 

By (JTj-CA)<, for v a limit number is meant the system with axioms 
, for each « < v. (IT}-CA)<, is equivalent to 92, which is provable in 
(11;-CA). Conversely, if 2 is assumed we can prove the completeness of 
T’ for predicates A\aVyR(&(y), Z). Hence the axioms of (II}-CA) 
follow from §,. Thus 


(3.1.3) (T{-CA)<,, and (IT;-CA) have the same theorems. 


The results of this section were obtained in collaboration with Friedman, 
who introduced the systems (J7;-CA)<, and established, in particular, that 
(4}-CA) is a conservative extension of (I7{-CA) <,, for 113 sentences ([4]). 
This subsection considers relationships involving (7;-CA)<,andID,, ID<, 
for arbitrary v satisfying certain closure conditions. In § 3.2 we consider 
the relation between these for v = w® and the comprehension rule (43—CR). 
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The general pattern of the results is summarized in § 3.3. 
The arguments are extensions of those of §2; only the few essentially 
new points involved are indicated. The following generalizes theorem 2.2.1. 


THEOREM 3.1.4 

(i). For each v < &Q of the form y* @ with y > 0, ID, has an w, T-model 

in (IT;-CA) <,+(BI). 

(ii). For each v < & of the form vy = w’ with y > 0, (I1;-CA)<,+(BI) 
has an @, T-model in IDr,. 

Proor. In (i), only {,}+(BI) is needed. For (ji) we use the fact that 
Jjsj<yRe*® is a model for (17;-CA)<,+(BI). For suppose Z € Re’* with 
|b| = B < v and suppose |a| = a < vy. A hierarchy X up to a with X¥, = Z 
can be found recursive in T,,, where |b @ al = B+a; but a, B < w’ 
implies (8+«) < @’. This argument is formalized in ID;,. 


COROLLARY 3.1.5. Suppose v < &Q is of the form vy = @” with limit y. Then 
(IT;-CA) .,+(BI) is @, T-reducible to IDy, <,. 

Proor. Any finite subsystem S of (17j|-CA).,+(BI) is contained in 
(I;-CA) <,, +(BI) for some v, = ”! with 0 < », < y. 


THEOREM 3.1.6. Suppose 2 < v < &. Then 

(i). for « < v, ID, has an w, T-model in (IT}-CA) <y; 

(ii). IDz, is @, T-reducible to (ITj-CA)<,. 

Proor. (i). The argument is similar to that for (3.1.41) (extending that 
for (2.1.2i)) but is simpler. A model for ID, can be defined from any X 
such that ,(X) (with ¥) = 9). (BI) is not needed here because the transla- 
tion of any formula %(x) is arithmetical in such an X; the work is taken 
over by (I72,—CA). 

(ii). This is a corollary of (i). 


Summarizing, we have 


THEOREM 3.1.7. Suppose v < & is of the form v = q’ with limit y. Then the 
theories (ITj-CA)<,, (IIj-CA)<,+(BI), IDz, and ID; <, are w,T- 
equivalent. 


The condition that y is a limit number cannot be omitted here since, by 
[8] § 8, the consistency of (I7;-CA) (in fact the full reflection principle) 
can be proved in (17;-CA)-+ (BI). 
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By [4] we have 
CoroLary 3.1.8. (4,-CA) = IDyz, <,- 


3.2. (43-CR) and (IIj-CA) <,,0. A formula of Y, is said to be essentially 
IT; (113) if it can be brought to a prenex normal form in which every set 
quantifier is universal (every universal set quantifier precedes every existen- 
tial set quantifier); it is said to be essentially 2; (22) if its negation is essen- 
tially 17; (113). The 4;-comprehension rule (n = 1, 2) is the rule of inference 


G21) —Ax15G) + 6) 
VX Ax[xeX @ S(x)] 


where % is ess. II, and © is ess. 2, (parameters permitted). (4}-CR) 
denotes the system obtained from (IT9-CA) by adjoining (3.2.1) as a basic 
rule. The usual reduction of ess. 11} formulas to I}-normal form requires 
choice axioms which are not available in (4,;-CR); this explains the use 
of the more general classes of formulas. 


THEOREM 3.2.2 

-(i). (43-CR) is @, T-reducible to (IT{-CA) <a.- 

(ii). (43-CR) is a conservative extension of (ITj-CA) <0 for IT, sentences. 

Proor. (i). The proof formalizes, in (IT{-CA)<, 0, the following model- 
theoretic argument. Let M, be the collection of sets recursive in T, for some 
B <a. M, = Jg<qMz for limit «. A X*(...) abbreviates A Xy,(---), and 
similarly for V X%(...). %* denotes the result of relativizing every set- 
quantifier in § to M,. In this notation, the fact that Rc™* constitutes a 
basis for each predicate Yj in some Tz, B < a, is expressed by: 


IK DLN OR, OS Ne DC ea ak (1) 


We next claim that with any ess. 2} formula G(X) can be associated ©, (X) 
of the form V Y9(X, Y) such that for any « 


\ X*[ OX) > G(X] 0 A XTOUX) > G(X). (2) 


Suppose, for example G(X) = AkV YAmV ZB(k, m, X, Y,Z). Take 
G(X) = VY, ZAk, mB(k, m, X, Y;,, Z,, m). The second implication is then 
trivial. For the first, given any «, enumerate M, = {Wo,..., Wr,..-} 
arithmetically in T,. Then @(X)o AkV yAmvV zB(k, m, X, Wy, Wr); 
if this holds we can choose y = f(k), z = g(k, m) where /f, g are arithmetical 
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in T,. The sets Y = {{x, k>: x € Wygy} and Z = {<x, (k, m>): x € Wik an} 
give the first implication in (2); in fact, since these are recursive in T, +, 
we can take @7*?(X) instead of &,(X) there. 

It follows that if 8 is a limit number, B < a, and §(X) is ess. 17} then 


\ X*TS(X) > B(X)]- (3) 


Suppose, for example % is of the form / Y@(X, ¥Y) where © is ess. Yj. 
Assume X is in M, and that /\ Y*@*(X, Y). Given Yin Mg, it is to be shown 
that G’(X, Y) holds. By (2), © ,(X, Y) holds with © (X,Y) = V ZX, Y,Z). 
Let y < f with both X, Y in M,. By (1) Vz" "Wy, 4) andi tence 
G(X, Y), giving G(X, Y) by (2). 
The main step is to show the following: 
Suppose n > 0 and that a sentence % has been proved in (43-CR) 


using at most (n—1) applications of the 4}-comprehension rule; (4) 
then %* is true for every « of the form « = w"*6 with 6 > 0. 


This is proved by induction on n. It is sufficient to show that if §(x, X), 
G(x, X) are ess. IT}, 23 resp., without other parameters and 


AX? A x£8"(x, X) G(x, X)] (5) 
holds for every B = w"- A, 4 > 0, then 
A X°V Y*Ax[xeY — 8x, X)] (6) 


holds for every 2 = @"**6 with 6 > O08 Wiite a —w -(@ 6) ee 
is a limit number with w° 6 =lim, 4, for some sequence of 4, > 0. Each 
B, = o"* A, is also a limit number since n > 0. Then any X in M, belongs 
to M,,, for some k. By (3) and its dual, both 


A x[3%(x, X) > B(x, XY] and = AxfG**(x, X) > G(x, X)] 
hold. Then by (5) 
A xEB%(x, X) F(x, X)]. 
(6) follows from this and 
A X®*\y ¥* Ax[x € Y <— SP*(x, X)]. 
Let S be any derivation in (43-CR) containing, say, (m—1) applications 
of the rule (3.2.1). (4) can be formalized to give an w-model of S in 


(IT}-CA) <m+1. AS one proceeds by induction onn < m, it is only necessary 
to consider those a = w"+ 6 with a < w”™. This proves (i). 
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The entire argument can be relativized to any initial set Z. Write M2 
for the corresponding collections; Z is in M% for any «. If (4}-CR)+ 
AZV YU(Z, Y) then for some n, (I1{-CA) <4 proves 


(A ZAX mean V Yuz, U(X, Y) 
and hence \ ZV YU(Z, Y). 
To complete the proof of (ii) it must also be shown that each axiom of 
(IT{-CA) <o is provable in (43-CR). Note that for any « < &, |a| = a, 
Ay SaQy, Xa DY, X') > Au < W(X, = Xi} (7) 


is provable by transfinite induction on <,. It is sufficient to show that 
for each n < w, 


(ACR en No AX ZI. (8) 


Let |w,| = w", and let o represent ordinal product on <. Suppose (8) 
holds for n; then also 


Ay AX{S(y, X) > VY[Yo = XA L(y @ W,, Y)]} 
and 
NZ EX SO(Weoz, CA Xo — Z| (9) 


are provable in (43-CR). Let $(x, Z) be the formula 

Vy, 2V X{O(W, oz, X)AX, = ZAy <W,9ZAxEX,} 
and W(x, Z) the formula 

Vy, zA X{Q(W, 02, X)AXyp = Z> yp <W,0zaxeX,}. 
§(y, X) is arithmetical in an ess. IT; formula, so is both ess. I} and ess. 23. 
Thus %(x, Z) is ess. 2; and W(x, Z) is ess. 7, ; moreover 

AZAX(S(x, Z) > Ox, Z)] (10) 

is provable in (43—CR) by (7) and (9). Then so also is \ ZV X, \x[xeX, oO 
a(x, Z)], which immediately implies (8) for n+1. 
CoroLiary 3.2.3. (43-CR) and IDz, <ow are w, T-equivalent. 
It is worth noting that the proof of (3.2.2) also gives the following 


THEOREM 3.2.4. (43-CR) is closed under the 3}-rule of choice. 
PRooF. Suppose, for example 


(A/-CR)P AxV XA YU(x, X, Y). 
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By (4) above there is « = @” such that 
CUO ees Na VOC Es 2s, Ae 


Using the enumeration of M, arithmetical in T, (hence recursive in T,,,) 
we derive 


VG A AIK ONS ZR 
Since a is a limit number, the basis result (1) gives 
AY OU(x, Ze YN YO Za) | 


for Z in M,. Hence V¥Ax/A YU(x, X,, Y) is derivable in (17{-CA) <q. 
and in (43-CR). 


3.3. General pattern. Consider the usual jump (not hyperjump) operation 
and write ?(X) for 


X is a hierarchy based on the ordinary jump up to a. 


Let (JI9-CA), be the system with axioms \ ZV X[2(X)AXo = Z] for 
each a < y. It follows from Kleene [5] that (7 1-CA)<o+y is equivalent to 
ramified analysis with levels B for each B < y. Write (I7j-CA) for (IT?-CA) 
or equivalently (J78,-CA). 


G3.D@)(H/-CA)2 (i, -en 

(ti). (I -CA) ae eR?) 

(iii). ({-CA)<,, = (4}-CA). 
(i) is trivial; (ii) is fiom [1], 6.14 (using the above remark on ramified 
analysis) and (iii) from [3] Ch. II.5. By the results of [4] and this section, 
the same pattern is followed when we increase the superscripts on the left 
and the subscripts on the right by 1: 
(3.3.2)(i). ({-CA) 2, = (II{-CA) 

(ii). ({=CA).,6 = (45-CR) 

(iii). (IT!-CA) <,, = (A}-CA) 
As a matter of fact, these results and their proofs were suggested by the 
previous ones. Though the proof of (ii) (i.e. (3.2.2)) simply generalizes 


my proof of 6.14 sketched in [1], this possibility was noticed only recently 
by Friedman. It should be noted in addition that (3.2.4) corresponds to 
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[1] 6.27, according to which (4j-CR) isclosed under the Zj-rule of 
choice.* 

There is an important difference between (4j-CR) and (4{-CA). While 
the jump hierarchy up to ® actually is an w-model for (4j-CR), the 
hierarchy up to é) is not an w-model for (4{-CA) since the least w-model 
consists of all hyperarithmetic sets. A corresponding remark applies to 
(43-CR) and (43-CA). 


§ 4. Value of reductions to systems ID,. The remarks below suggest definite 
technical advantages of these reductions and some limitations of their 
foundational significance; some additional technical advantages appear 
in the discussion in § 5. 


4.1. The main results above yield conservative extension results for 
provably recursive well-orderings and ordinal functions as follows: In 
each case we have 8S, = S,, where S, is a subsystem of analysis and S, is 
one of the theories ID;, or IDr, <,. If S, + WF(R) then we can find ee T 
with *% provably embedded in the tree of e and S,+ 7(é), so S, and S, 
have the same provably recursive well-orderings. But S, = S, also implies 
that for f recursive, S,, S, prove the same sentences of the form 


i xe XL (1 Ans A LX) > Tf(%1,-- + X,)))- 


This extends to any theory S, which is a conservative extension of S, for 
IT, sentences. 

Since the ID, are very naturally associated with ordinals, it is not un- 
reasonable to expect to be able to develop a theory of their provably 
recursive ordinals and ordinal functions. This is obviously so for ID, 
(cf. {2]). It is not obviously so for ID, for v > I since the natural orderings 
associated are non-recursive; however, some collapsing argument may 
perhaps be possible. 


4.2. Evidently the familiar classical interpretation of g.i.d. is not relevant 
to the foundational significance of the reductions, because the existence of 
least sets (under suitable conditions on the defining formula 2) is derived 
from the comprehension axiom or a theory of ordinals. 


* (A — CA) = (2, — AC) = © — pe) by [3] Ch. 5, and (4}—CA) = (23;—AC) = 
(C35 —JDYC) by formalization of the Kondo-Addison theorem as verified by Mansfield; 
however, the arguments in this case are quite different. 
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Prima facie the reductions become candidates for constructive consistency 
proofs because of the following observation of Kreisel [6]. 

Let ID?(2) be the system ID,(2) with the logic restricted to be intuition- 
istic, except that only derivability of the monotonicity condition (1.2.4) 
on 9 is required,* and let ID? be the union of ID?(9) for such %. Then 
ID, < ID). 

However, Kreisel points out (in [8], esp. pp. 344 and 352, and in [9], 
§§ 4,5) that for v > 1, the principles used to justify ID) with (the usual 
interpretation of) intuitionistic logical operations also justify the whole 
theory of species of natural numbers, where the reduction of full classical 
analysis is immediate. Therefore the special significance of these results for 
constructive foundations of analysis is open. 


§ 5. Connections with the literature. We consider work by Tait and Takeuti. 
It so happens that in both cases the principles of ID, enter into their work 
and also variants, that is, formally quite different principles of inductive 
definitions, are considered. 


5.1.1. Tait gives consistency proofs for (4;-CR) and (2;-AC) by cut- 
elimination arguments in infinitary propositional systems PL® and PL”, 
resp. ((12], §§ 4,5). The formulas and derivations of PL” are represented 
by trees indexed by members of T, for « < v. The same cut-elimination 
arguments apply even more simply and more intelligibly to the theories 
ID;, <,, by assigning to every sentence % a sentence %}* of PL” with 
(Ti))* = \X/serii = a). 


5.1.2. The 2nd-order theory Ind of [12] § 1 has constants for set func- 
tionals Zy and axioms of the form 


A x[U(fu(X), x, X) > x € fa(X)] 
and 


A x[U(B, x, X) > B(x)] > Ax[x € fa X) > B(x)], 


where FY is positive in 2(Y, x, X). It can be seen that Ind is w, T-equivalent 
to ID, by the same arguments as in 2.2. 


5.2.1. Takeuti [15] has given a consistency proof for (JIj-CA) by trans- 
finite induction on the ordinal diagrams O(w+1, w*) applied to a prim. 


* The argument in § 1.2 reducing to %1,(P, x) with positive P does not hold intuitionisti- 
cally. 
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rec. predicate. The principles of g.i.d. enter this work implicitly, in the 
following sense. At the present time, the most elementary proof of the well- 
foundedness of O(w+1, w*) uses ID,; the method is by an extension of 
the proof by Kreisel (review Zentr. f. Math. 106 (1964) 237-238) and by 
making use of [13]. 

It seems likely that the following converse holds; a proof of consistency 
of ID,, and hence of (IIj-CA), should be given using induction on 
O(@+1, —); moreover, this should be much simpler than [15]. 


5.2.2. In [14] and [15] ch. 4, Takeuti considers 2nd order systems with 
additional symbols P(x, y), P(x, y,W) resp. and axioms of the form 


(1) P(x, vy) <> B(2((2)1 < y A P((Z)o» (2)1))s % Y)s 
(2) P(x y, VY) 8 (2((Z)1 < y A P(Z)o» (2) 7), Ys M7); 


where < is a given prim. rec. well-ordering, %(X, x, y) resp. @(X, x, y, Y) 
is arithmetical in 1{-formulas, and, in (2), VY ranges over arbitrary ‘varie- 
ties’, i.e. abstracts G(x) (written {x}G(x) in [15] for any formula G. Thus, 
in contrast to our systems ID, these axioms contain also 2nd order variables. 

We establish relations between the systems in the text and, for reasons 
to be explained in a moment, somewhat modified variations of Takeuti’s 
systems. 

Let « = |<|, and suppose (for simplicity) that a is of the form w’. Let 
S.. be obtained from schema (1) by adding TI(<) and (JT2,-CA); P is 
treated as a 2nd order constant symbol. Then 


(3) (i). Sz. is w,T-reducible to IDz,. 
(ii). IDz, is w,T-reducible to S.. for suitable choice of &. 


(It is understood as in § 1.3 that ID;, is described by reference to < and 
contains TI(<).) The proof of (3i) is obtained by formalizing the construc- 
tion of an w-model of S. with P recursive in T,. 

Let S2 be obtained from schema (2) by restricting Y to be a set variable, 
and adding WF(<), (BI) and (JJ;}-CA); P is now treated as a functional 
syimbolionAmen (a )oa(e),1), Then 


(4) (i). SZ is w,T-reducible to IDy,..., 
(ii). IDy,.,, is @,T-reducible to S% for suitable choice of §. 


The proof of (4i) goes via an w,7-model of S2 in (IT{-CA) <,..+(BI) (in- 
cluding TI(<)) and the generalization of (3.1.4ii). 
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These results can be generalized to systems involving iterations of the 
schemata (1), (2) considered in [14], [15]. 

Concerning the modifications, it seems that TI(<) must have been in- 
tended (though it was not explicitly included) since it is difficult to see how 
the schemas (1), (2) can be applied non-trivially without it. In any case, 
inasmuch as the concern in [14], [15] is with consistency proofs, if one can 
get a reduction for an extended system by the same principles, so much the 
better. With regard to the restriction to set variables in (2), no explanation 
is given in [15] for the substitution of £@(x) for Y in the atomic formula 
P(x, y, Y), and it is not clear exactly what was intended. A form of (2) 
is derivable in SZ for those abstracts “ = &G(x) which can be proved to 
define a set, e.g. when G(x) = G,(x, P) with G,(x, X) arithmetical in I7}- 
formulas. 
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BROUWER’S BAR THEOREM AND A SYSTEM OF ORDINAL 
NOTATIONS 


HARVEY GERBER* 


In [3] Gédel proves the consistency of arithmetic by using computable 
functionals of finite type. Spector [6] extends Gédel’s proof to a consistency 
proof of analysis by introducing a new definition, bar recursion, and a 
corresponding principle of proof, bar induction. Bar induction is a general- 
ization of Brouwer’s bar theorem [1] (bar induction of type 0) to higher 
types. The problem still remains of obtaining a proof of the consistency of 
analysis by using the constructive theory of ordinals. 

In § 1 we construct a system of ordinal notations based upon the work 
done in [2]. This system of notations has been used by Wm. Howard 
(unpublished) to prove the consistency of the formal system obtained by 
adjoining the axiom of bar induction of type 0 to elementary intuitionistic 
analysis. 

In § 2 we prove the well-ordering of any proper segment of our notations, 
but not the whole segment itself, using Brouwer’s bar theorem formulated 
in the form of TIp, ‘the axiom of transfinite induction’ for decidable rela- 
tions p. 


§ 1. A system of ordinal notations. The terms 0, Q°, Q°[a, a] defined below 
can be interpreted as the ordinal numbers 0, 1, ga*a* respectively (of [2]), 
provided that the terms «, a are interpreted as the ordinal numbers «*, a* 
(also of [2]). We recall that 0 is any normal function whose range consists 
of limit ordinals. 


DEFINITION 1. Terms and class Cl of terms of the system T. 
1.1. Ois a term and C/(0) = 2. 


* The author is grateful to the National Research Council of Canada (Grant # A-3723) 
for financial assistance. The author also wishes to thank Professor William Howard for 
his many helpful suggestions. 
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1.2. If « is a term then 927 is a term and 


> itF @ == O. 


3 otherwise. 


CU) = | 


1.3. If a, B, 6 are terms and C/(b) = 2 then Q*[f, 5] is a term and 


2 ifae=0, 
3 otherwise. 


cuStTB, bY = | 


1.4. Ifa, B are terms, then « + f is a term and 
Cl(a + B) = max {Cl(a), Cl(p)}. 
1.5. If « is a term then 270 is a term and C/(Q*0) = 2. 


Intuitively, C/(a) = 2 (resp. Cl(~) = 3) says that the ordinal which « 
denotes is in the second (resp. third) number class. This concept is needed 
in order to provide the restriction in 1.3 above which corresponds to the 
restriction in [2], that the function @f is defined only for arguments b 
such that 5 is in the second number class. We call the terms of the form 
Q* or Q*[B, b] the principal terms 

Let *, < > be defined as in [4]. 


DEFINITION 2. We define the sequence of summands of a term a, written 
S(a), as follows: 

2.1. If x is a principal term or 0 then S(«) = <a). 

2.2. lia — f sey thents(2)i—5(p een 
If S(x) = <a,,...,%,> then we say that each a, (1 < i < n) is a summand 
OL X. 


DEFINITION 3. Degree of terms: 
Sey 0: 
3.2. D(Q*) = D(a) +1, 
3.3. D(Q*[B, b]) = D(a)+ D(p)+ D(b)+1, 
3.4. D(a # B) = D(a)+D(B). 


DEFINITION 4. Let ¢ be a principal term. We define the exponent and the 
coefficient of t, written E(t) and K(t) respectively as follows: 

4.1, E(Q*) = a, K(Q*) = 9° 

4:2, E(B; bi) =a, 6218, be Pere 
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DEFINITION 5. We define the relation C(x, t) read ‘x is a constituent of t’ 
as follows: 

51. lit = 0 thence) ith — 0, 

52) Leu? — 1) + wWitete ?,1S elther a principal term or @ for 
l <i <n. Then C(x, t) iff x = K(#,) or C(x, E(t,)) for some / if t, 4 0, or 
fa — Unit, — 0) 


DEFINITION 6. We define the relation P(x, «, a) read ‘x is a part of Q°(a, a)’ 
as follows: 
P(x ia) aeille Clea) VY x =a. 


Before defining the <-relation and the =-relation we introduce the following 
abbreviations. We abbreviate a<bva=b by a< b, —(a=)5) by 
@obunda=—b by b> a letr—7,4 ... + ¢, where cach 7#, is either 
a principal term or 0 (we call such a representation the normal form represen- 
tation (n.f.r.) for t) and at least one of the ¢; is different from 0. Let L(t) 
denote the summand 1; 4 0 of t, of largest index i, such that E(t;) < E(t,) 
for all j (1 <j < n) where t; 4 0 and such that K(t;) < K(¢,,) for all m 
(1 <m<n), where E(t;) = E(t,,). We write (a, a], 1, Q etc., instead of 
Q°[a, a], 2°, a”, etc. 

Assume that a #0va#0. Let DP(a, a) denote the term a if a #0. 
If a =0 and K(L(t)) # 1 or E(L(t)) = 0 then DP(a, a) denotes K(L(t)). 
If a=0 and E(L(t)) #0 and K(L(t))=1 then DP(«,a) denotes 
DP(E(L(t)), a). We write x = DP(a, a) and say ‘x is the distinguished part 
of (a, a)’. 

One final abbreviation. Assume thata 4O0VQ < a. Lett =1t, +... +1, 
be the n.f.r. for ¢. If a 0 then we write x e ND(zt, a) iff C(x, t). Ifa = 0 
and K(L(t)) # 1 and Q < t then we write x e ND(t, a) iff C(x, t, #... + 
+ L(t)* +... + t,) where L(t)* is obtained by replacing L(t) = Q°[f, 5] 
by 2%. If a = 0, E(L(t)) = 0 then we write x e ND(t, a) iff C(x, t, +... + 
+ L(t)* +... 4+ ¢,) where L(t)* is obtained by replacing L(t) by 0. Finally, 
if a=0, E(L(t)) #0 and K(L(t)) =1 then we write xe ND(t, a) iff 
Cee l(t) eet i) where L(f)* is obtained by replacing 
L(t) by 0 or xe ND(E(L(0)), a). We say ‘x is a non-distinguished part of 
(a, a) if xe ND(q, a). 

The following definition is a definition by induction on the total number 
of components and parts of the two terms involved. Hence this definition 
gives an effective ordering of the terms. 
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DEFINITION 7. Inductive definition of the =-relation and <-relation on T: 
7.1. 0 = 0,0 < a for all a 0, and 20 = 0 for all «. 
7.2. Let t, = Q* and t, = OF. Ifa = B, then t, = fy. 
7.3. Let — @' le... @,), tp = @ leone | with = 2p: 
7.3.1. If x, = a, and a, = a, then t, = fp. 
732. 6,24, andi|@j.c.— WEG) and 


Ax[xeND(q,, a) ~ x < [o2, @2]), 


Ueno; wate 
7.3.3. Ifa, < a; and [a,,4@,))= DE@o ee, and 


Ax[x €ND(a2, a.) > x < [o,, a,]], 


then t,; = f,. 

7A. Let S(a) = <o,,...,0,> and S(P}=<Bi.-2.,6,>. Wf k= aaane 
there is a permutation (/,,...,/,) of (1, ..., &) such that «; = B,, for each 
i=1,...,k, thena = Bp. If k <n (resp. n < k) and there is a permutation 
(is... 1,) of (1, nm) (respia) permutation (15), ) of (eae oy 
such that a,, = 8, for 1<m<k and $8, =0 for k <m<n (resp. 
a1, = Bmforl<m<nanda, =0forn<m<k), thena = p. 

7.5. If t,, 4, are principal terms and 1f £(7,;) =< £(2,), theny7p-pe 

T.6Rlf tf, = 2, 1 O' [peal andtive:— ethene etry. 

7.7. Let t; = Q%[o,, @,], t2 = Q*[o,, a,] with a = fp. 

7.7.1. Ifa, = a, and a, < a,, thent, < fy. 

7.7.2. lf wy ay and A. x{ P(x, 0,4) > x < [a5, a5)}, thence 

7.1.3. TE V x{ P(x, op, 65) Alley, alia; sthen tec ioe 

7.7.4. If Vx[xeND(a2, a.)Ax = [o,, a,]], then t, < fy. 

7.8. Let B be a principal term, then fies. pb) = py eulane 
ede ee ee 

7.9. Let «, B,,....6,, be principal termse7 > elie = b. forsomer. 
(l<k < m), thena < B, #... + Bn. 

7.10. Let ase, 4. eee, 8 — Bae 8 be the inet eien 
@, BP resp. where n > lo) = WIE there existia yf <7) (cee) 
such that «, = f, and 


Oy te FE Oy AE Oy TE ee. EO < By HE... HE By HH Bea H-.. HB 


then « < B. 


Associativity and commutativity follow trivially from def. 7.4. We see from 
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def. 1.3 that b < Q in the term Q*[f, b]. The following theorems show that 
= is an equivalence relation, the <-relation is compatible with =, and 
that the system T is linearly ordered. 


THEOREM 1.1. Let a and b be any terms; then 
(i) a = aand —(a < a). 
(ii) Ifa = b, then b = a. 
(iii) One and only one of the following hold: 


a<ba=b,b<a. 


Proor. We prove (i) by induction on D(a), and we prove (ii) and (iii) 
by induction on D(a)+D(b). 


THEOREM 1.2. Let a, b, c be any terms, then 
(i) a= bab=cr7a=e, 
(ii) a< baAb<cra<e, 

(ili) a= bab<ca<e, 

(iv) a=bac<b>c<a. 

Proor. We prove the conjunction of (i)-(iv) by induction on 
D(a)+ D(b)+D(c). Namely, if a’, 6’, c’ are any terms such that 
D(a’)+ D(b')+ D(c’) < D(a)+D(b)+D(c) and if a’, b’, c’ satisfy the 
premise of (i), (ii), (iii) or (iv) then the conclusion follows. Let us assume 
Pidtee — |[0;,¢,b00 — [o5,4,|, ¢ = |o3,¢@,|, where a; =o, a, = 4, 
dt, < 43, [«3,a;] = DP(a,,a,) and Ax[x¢ ND(«2,4@,) > x < [a3, a3]. 
By the induction hypothesis «, = «, and «, < a; imply a, < a3. Since 
a, =a, and a, = 4), it is easy to see that DP(«,,4,) = DP(a,, 4,), 
and that if ye ND(«,,a@,) with y # 0, then there is an x e ND(ap, a3) 
such that y = x. Thus by the induction assumption DP(«,, a,) = [a3, a3] 
and A x[x ¢ND(a,,4@,) — x < [#3, @3]JJ. Thus [#,,4,] = [«3,4,]. The 
other case are similar. 

We can now easily show that if b < a # 1 then d < a. 


THEOREM 1.3. A x{P(x, «, a) > x < [a, a]}. 

Proor. By induction on D([x, a]). Let p be the maximum part of (a, a). 
We can assume that p = DP(qa, a) and all other parts of (a, a) are less 
than p, for Otherwise » <a (0, a). Let p = [f, 6). i a < £, then [«, ad] = 
= [f, b]. If a= 8B and a= [f, b], then b < [f, b] by the induction hypoth- 
esis and hence [f, b] = [«, b] < [z, (8, b]] = [o, a]. If a =O then pisa 
constituent of «. Since « = B there is a constituent g of Bf such that p = q. 
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If b = 0, then p = [«, a]. If b > 0 then [f, b] > g = [f, b] which contra- 
dicts theorem 1.1. 

Now assume that 8B < a and a =0. Let P(r, 8,b) with r > [a, a]. Let 
a* be obtained by replacing p by r in «. Then, since [f, b] > r, we have 
r < [«*, 0] < [«, 0] <r, a contradiction. If there is an re ND(f, 5) such 
that r = [a, a] then, since [f,b] > r, we have r < [a*,0] < [«, 0] =r, 
again a contradiction. If P(r, 8, b) implies r < [«, a] then [f, b] < [«, a]. 
If re ND(f, 5) > r < [«, a] and p = [«,0] where p = DP(f,b) then 
[x, a] = [f, 6]. The case when a ¥ 0 and f < « is similar. 

It is easy to see that if p = 0, p = 2° Crp — i, 47... 4 tp, theme 
theorem holds. 


§ 2. Well-foundedness. In this section it is useful to alter the definition of 
term by writing Q'(a, +... + ,) for Q'a, +... + Q%a, where 
Cla; =2 (1<i<n). Thus, for example, we write Q#) 4 
+ Q[0, 0] + 1) instead of Q%-*° 4 O70, 0] + Q. If 1 is a summand 
of t, then we write t = « + 1. If t = « + 1 then the term ¢* = « is defined 
inductively in the obvious way. If it is not the case that either t= 0 or . 
t = a + 1 then we write ¢ ¢ Lim. 


LEMMA 2.1 

(i). Suppose [~ + 1,0] > [8,5]. Then there exists a k such that 
g9(k) > [B, b], where g(1) = [«, 0], g(n+1) = [, g(n)]. 

(ii). Suppose [x + 1,a #1] > [8,5]. Then there exists a k such that 
g9(k) > [B, b] where g(1) = [w # 1, a] + 1, g(n+1) = [e, g(n)]. 

Proor. We prove (i) by induction on D([f, b]). It is easy to see that 
a +1 £. Assume that « + 1> 6 and [« + 1,0] > p where p:p is the 
maximum part of (f, 6). By the induction assumption there is a k such that 
g(k) > q, where qg is the maximum summand of p. Thus g(k) > p. If 
a > B, then g(k) > [f, b]. If « = B, then g(k) > b and thus g(k+1) = 
= [x, 9(k)] > [Bb]. 

If there is a part of (« + 1,0) which is greater than [f, b], then there is 
a part of (a, 0) which is greater than [f, b]. Thus g(1) > [f, b]. If there is 
a pe ND(« + 1,0) such that p = [8,5], then there is a part of («, 0) 
which is equal to [f, b]. Thus g(2) > g(1) = [f, db]. 

We also prove (ii) by induction on D([f, b]). 

Let « # 0 be any term. If K(L(a)) # 1 or E(L(«)) = 0, then a(c) is the 
term obtained by replacing the! summand L(w) of « by the term QE4@)¢, 


E XxI A SYSTEM OF ORDINAL NOTATIONS 333 


If K(L(«)) = 1 and E(L(«)) #0, then «(c) is obtained by replacing 
L(a) by 2° where B = E(L(«)). Let ¢ = [a, a] # [0,0], then t{c} = 
[x,c] if a#0 and t{c} = [a(c), a] if a= 0. We see that t{c} is ob- 
tained by replacing the distinguished part of (a, a) by c only at DP(q, a). 


LEMMA 2.2. If t = [a,a] > [B, b] and if DP(«, a)eLim then there is a 
c < DP(q, a) such that t{c} > [B, 5]. 

Proor. By induction on D([f, b]). If « = B and a > b then ae Lim thus 
choose c = b # 1. If « > B and all parts of (f, b) are less than [a, a], then 
by the induction hypothesis there is a c such that t{c} > p where p is any 
part of (f, b). Note that a(c) > B. Thus t{c} > [f, b]. 

Assume that there is a part p of (a,a) such that p > [f,)]. If 
eine a) tncnesctece— al, Die ip ~ DP; a), then set_c— 1. 
If g¢ ND(q, a) and q = [f, b], then choose c = 1 + 1. 


LEMMA 2.3. Let [a, a +1] > [8,5]. If DP(a, 0)¢ Lim then either [a, a] > 
> [B, b] or there is a c < DP(a,0) such that [a(c), [«, a] #1] > [f, b]. 

Proor. By induction on D([f, 6]). If « = B then a + 1 > b and hence 
az b. If a > bd then [a, a] > [f, b] and if a = b then [a(c), [«, a] + 1] > 
> [f, b] with c = 1. Assume that « > f and all parts of (f, b) are less than 
[a, a + 1]. Let g be the maximum summand of the maximum part p of 
[eo Then either je,a| > q¢@ and hence [¢,a] > [8,6], or there is a 
c, < DP(a, 0) such that [a(c,),.[«, a] #1] > q. Since « > B there is a 
C, < DP(a, 0) such that a(c,) > B. The lemma holds taking c; = max(c,, ¢2). 

If there is a part p # a #1 of (a,a #1) such that p > [8,5] then 
eel |[f, Ol) liege a 4 t then |x, a)+ 1 = [ea] 2 [f, b]. Thus the 
lemma holds taking c = 1. Similarly, if p¢ ND(«, a + 1) and p = [f, 5], 
then since a +1 # [8,b] we have [«,a] + 1> [a,a] > [f,b]. Thus 
choose ¢ = I. 

Let « £0 be any term such that DP(«#,0)=1 and a#y +1. If 
E(L(«)) = B # 1, then a[c] is the term obtained by replacing the summand 
L(a) of « by the term Q%c. If E(L(«)) # B + 1 then a[c] is obtained by 
replacing L(«) by Q°"1 where B = E(L(a)). We see that if a > B and c 
is greater than any constituent of B then a[c] > P. 


LEMMA 2.4. Let [a,0] > [B, b] where « #0, DP(a,0) = 1. fay #1, 
then there exists an n such that g(n) > [B, 6], where g(1) = [z[1], 01, 
g(n+1) = [alg(n)], 0]. 

Proor. By induction on D([f, b]). It is easy to see that « # P. Assume 
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a > Band all parts of (f, b) are less than [«, 0]. Then there is an m such that 
g(m) > p where p is the maximum summand of [, 5]. Hence a[g(m)] > 
and thus g(m+1) > [f, 5]. 

Assume that there is a part p of (#, 0) such that p > [f, 5]. Then p is a 
part of g(1) and hence g(1) > [8, 6]. Similarly any non-distinguished part 
of (a, 0) is a non-distinguished part of g(2), thus g(2) > [f, 6]. 


LEMMA 2.5. Let [a,a +1] > [B,b] and let a be as in lemma 2.4 with 
a & y +t 1. Then there is ann such that g(n) > [B, b] where g(1) = [«, a] + 1, 
g(n+1) = [a[g(n)], 9]. 

ProoF. By induction on D([f, 5)). 


Note that lemmas 2.1—2.5 hold when [f, b] is replaced by an arbitrary term 
t*, Simply take S to be the maximum summand of f*, and apply the 
restricted result. 

Before proceeding to a proof of the well-foundedness of any proper 
segment of our notations we need the following definitions. We assume that 
the reader is familiar with the formalization of H (elementary intuitionistic 
analysis) as given in [4]. Let Q be any property, and p any relation definable 
ind. 


DEFINITION 8. Q is progressive with respect to p, written Prog (Q, p), if 
Ax{Aylxpy > Q(y)] > O(>)}- 
The principle of proof by transfinite induction with respect to p up to a, 
written I(p, a), is 
\ Q{Prog (Q, p) > A ylapy > Q(y)]}- 


DEFINITION 9. The relation p is well-founded up to a, written WF (p, a) 
if given any free choice sequence f, if apf(0) then 


Vans [f(n)of(n+1)]- 


Finally by the axiom of transfinite induction for a given relation p up to a, 
written TI(p, a) is meant the axiom scheme 


WF(p, a) > I(p, a). 


We shall be using only one order relation > thus we will write Prog (Q) 
for Prog (Q, >) ete. 

As in [5], p. 29 we can find a one to one mapping W from our notations 
into the natural numbers. Let us assume that this has been shown. In what 
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follows we identify the notation with the natural number given by yw. 
Let 
Tie ze — 
and 


Cer (apy Cee 2, XO ner geste (0 RC ae ors > +1277) 
INotethat, for example, 7)(¢;.2,1)—o, +27 7 ,(¢,,2,0)=a, and 
Mey. y 0) — Ty (ay Oy) 2) 1). We define 

Gi ay, Zara (bly (04, 24 0,, 2%). -2 |. 

ete O (oq 2) denote 
a, #0 > E(L(a,)) > z 
amd Up. (Cin 1,4, 2) denote 
ee = EE I (Ghee, 2) IIA U, (a5, os 2 0,445 2): 
et Pay... .0,. 2, x) denote 
A b{WF(b) — WF(S, (015. . +5 ns Zs X> b))}; 


e.g. P2(a,, %2, Z,x) denotes \ b{WF(b) > WF([a # 277°, b]. Finally 
let Q,(z, x) denote 


Wie ere uU Kan eerie. 2A P(a,,-. . @,,Z, 0) 
=P, Cee ozo) 
and 
a, £ OAE(L(a«,)) = z > K(L(a,)) > x. 


LEMMA 2.6. If 
IWIE(S, (aes ey Zs X, 0) (1) 
and 


A aL WF(a) > {WFS,(a,,..-, %q5 Z) %5 4) 
mae WFS,(a, ye+9Kn, Z,X,4 +E 1)}], (2) 
Pee (Ois . s8q Zy x): 
PRoor. We will get P,(a,,..-,%,,2,x) from TI by showing Prog Ab 


WFS,(%1,-. +» %a» X, 0). In view of (1) and (2) the only case to consider is 
when b € Lim. Prog 2b. WFS,(a,,...; &,, x, &) then follows from lemma 2.2. 
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LEMMA 2.7. Suppose xe Lim, x <'@ and (Ay < x)P,(o,. .- a, fey 
Dien PGs, zee 

Proor. We establish (1) and (2) of lemma 2.6. To get (1) observe that 
the hypothesis implies 


Cpe) WE (SiO ipee poz. v0 ))e 


hence S,(a,,..., %, Z, X, 0) by lemma 2.2. To prove (2) let @xb denote 
MHC mep ec 508 fe IMAC) 
Ox(a + 1) = lim 0y((Oxa) + 1) (3) 
y<x 


by lemma 2.3. Assume WF(a) and WF(0xa) (so WF((0xa) + 1)). Hence 
from (Ay < x)P, (01, .. +5 &,, Z, ¥) we conclude (A y < x)WF(Oy((Oxa) + 
+ 1)). Thus WF(6x(a + 1)) by (3). 


THEOREM 2.1. If Q,(z, 1) then Prog Ax . Q,(z, x) for all x < Q. 

Proor. Q,(z, 0) is obvious. \ x{Q,(z, x) > Q,(z, x 4+ 1)} follows from 
Q,(z, 1) because P,(o;, «22; G,, 2% 4¢ 1) is justee(a ye oe tee eee 
It remains to prove, for x e Lim, x < Q, that Q,(z, x) if (Ay < x)Q,(z, y). 
This follows from lemma 2.7. 


Coroiiary |. Q,(z, 1) > Q,(z # 1, 1). 

Proor. Assume Q,(z, 1). Then A x{WF(x) > Q,(z, x)} by theorem 2.1 
and TI. Assume U,(4,,..50,.2 4 f)) and Pio... 2,0, ei 
we must prove P,(a1,..-.,%,, 2 + 1,1). By lemma 2.6 it suffices to prove 

WES, (615.4 G,. 2-226 0) (4) 
and 


A al WF(a) > {WF(S, (a1, .. +) %q, Z # 1,1, a)) 
= WE(S,(¢;5 senza 1, 1a 2) a) 
From /\x(WF(x) > QO,(z, x)l- U (oie c,, 2 1) and Po. eee 
z + 1,0) we conclude 
Ax[WEG@)Iea Bice. cz, x) (6) 


Hence /\ x[WF(x) > WF(S,(a1, ...; %, Z, X, 0))]. Hence (4) follows from 
lemma 2.4. To prove (5) let g(n) be as in lemma 2.5. Hence, assuming 
WF(a) and WF(S,(a1,-...,%,,2 # 1,1, a)), we conclude | nWF(g(n)) 
with the help of A x(WF(x) > WFS,(a1,..., %,.Z, X,0)) proved above. 
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But 
See one eee 1 te 2 1) — in gn) 


by lemma 2.5. Hence WF(S,(«,, ..., %> Z4#1, 1, a#1)). Thus (5) is proven. 


CorOLLary 2. Q,(0, 1). 

Proor. If n> 1, then Q,(0,1) is Az(Q,-,(z,1) > Q,-1(2 # 1,.1)), 
so Q,(0,1) from corollary 1. We must prove Q,(0,1) separately. This 
proof is similar to the last part of the proof of corollary 1, a function g 
being constructed by use of lemma 2.1] and an appeal being made to lemma 
2.6 (for S,; and P,). 


We define Q, for all n < w inductively as follows: 
Qo = Q, 
One. = ae 


Coro.iary 3. For all k, WF([{Q,, 0]). 

Proor. We first prove (/\\b){WF(b) > WF([0, b])}. Assume WF([0, a]) 
for alla < b. Let f generate a descending chain, where f(0) < [0, b]. Then 
(0) = k[0, a,] + B where k is an integer, a, < b and B < [0, a,]. Since 
f < [0, a,] we can apply the hypothesis of transfinite induction and conclude 
that there is an n such that f(0) >... > f(n) = j[0, a,] # y, where j < k 
and y < [0,a,]. After finitely many applications of this argument we get 
p such that f(p) = 6, where 6 < [0, a,], so we can apply the hypothesis of 
transfinite induction once more to get g such that — [f/(p) > f(q+1)]. 
iMowsero, — ee — e— 2 — in corollary 2 and get 


20 a1 
A b{WF(b) > WF([Q——, b])} > Ab{WF(b) > WF([Q——, b))}. 


Hence from (A b{WF(b) > WF((0, b])} and Q,(0) and ... and Q,(0) we 
get 
2 


Coro.iary 4. Let T* = {a:aeTAa < Q}. Any proper segment of T* is 
well-ordered. 
Proor. From corollary 3. 
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From theorem 1.1 we see that the relation < is decidable. Thus we have 
proven the well-ordering of any proper segment of our notations less than 
Q by using TIp. 

Simon Fraser University 
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REGULAR ORDINALS AND NORMAL FORMS 


DAVID ISLES 


1. Introduction. The ordinary Cantor normal form for countable ordinals 
§ employs the ordinal functions of addition, multiplication and exponentia- 
tion (based on q) to express 0 in the form w"'n,+ ... +@*n,; if 0 is nota 
fixed point of the normal function Axw*, the ordinals 0;, n; are all < 0. 
In as much as the fixed points of any normal function f are themselves the 
range of a normal function (called the derivative of f; see theorem 2.1), one 
can define a sequence of normal functions {f,} where f enumerates the fixed 
points of f, for « < f. This sequence can be used to obtain normal forms 
for an initial segment of the second number class in which many of the 
fixed points of Axw* are included. The length of this segment depends on 
the size of the collection {f,}. Starting with work of Veblen [7], Bachmann 
in [1] showed how to use normal functions of the third number class in de- 
fining the set {f,}; this procedure was later continued by Gerber [4] several 
of whose ideas occur here. In this paper, the process is generalized to allow 
the use of the wth number class for certain transfinite «. 

Let G = G, be the normal function which enumerates the closure under 
limits of the class of regular ordinals (def. 2.2). In section 3 we employ the 
sequence {G,}, where G, enumerates the fixed points of G, for « < f, in 
defining normal forms for all regular ordinals 6 < G;(1) where J is the first 
regular fixed point of G. Normal forms for ordinals a, G(p) < a < G(p+1) 
are obtained by constructing sets of ordinals Or(p) © G;(1)+1 and then 
generating the derivatives o, of the function AxG(p)* (exponentiation) for 
a € Or(p). In section 6 we define the sets Or(p) by induction on a < G;,(1): 
if « < G(p+1), then « € Or(p); if G(p+1) < & and «e€ Or(q) (p+1 < q) 
then, roughly speaking, « € Or(p) only if it is possible to define a normal 
form for « in which all of the ordinals occurring are < G(p+1). Details of 
the procedure whereby a normal form for « € Or(q) can be constructed are 
given in sections 4, 5 and 7. 

We conclude this paper first by demonstrating that the normal forms de- 
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veloped include those of Bachmann and yield a recursive ordering for a seg- 
ment of the countable ordinals, and then by indicating how they can be 
used to replace ordinal diagrams in some of Takeuti’s work. Most proofs are 
omitted. 


2. Bachmann collections of ordinals. In what follows we will consider an 
ordinal number to be the set of its predecessors. A normal function is an 
increasing continuous ordinal function; 0 will not in general be included 
in the domain of the normal functions discussed here. If fis any ordinal func- 
tion and @ an ordinal, Vf will denote the range of f, f| 6 the restriction of f 
to 0, Suc(@) that 6 is a successor, and Lim(6) that 6 is a limit. 


DEFINITION 2.1. A limit ordinal @ is regular if there is no 6 < é and function 
f: 9 + @ such that @ = lub {f(a)+1|« < 6}. 


To secure the existence of regular ordinals we add to the usual axioms of 
set theory (not including the axiom of choice) the following axiom. 


AXxNorm. Every normal function defined for all ordinals has arbitrarily 
great regular fixed points. 


Applying it to the identity function, we obtain an enumeration of the clo- 
sure under limits of the regular ordinals. 


DEFINITION 2.2. G(1) = o. 

G(@+1) = least regular ordinal greater than G(6). 

Lim(2) > G(u) = lub{G(6)|@ < pz}. 
In what follows up through theorem 2.5 we will suppose that fis a normal 
function on the regular ordinal 0. 


DEFINITION 2.3. The mth iterations of f are the normal functions f/”(x) 


where f(x) =~ andy (aed) 


THEOREM 2.1 (Veblen [7]). The set of fixed points of f is order isomorphic 
to 0. The order isomorphism f’ is a normal function on 0 (called the derivative 


of f). 
THEOREM 2.2. (1). f’(1) = lub {f"(1)|1 < n < a}, 
(2). f’(x+1) = lub {f"U'(x) +1) < 2 < o}. 
Both of the given sequences are fundamental. 
Let {fell < 0<yp, Lim(u), up < 0} be a sequence of normal functions on 0. 
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THEOREM 2.3. (Veblen [7]). Suppose that if 1 < 0 < yu then Vf, S Vf, for 
1 < v < 0. Then () {Vfol 1 < 0 < pt} is order isomorphic to 6 and the order 
isomorphism @ is a normal function on 0. 


THEOREM 2.4, (notation from theorem 2.3). Let 


= lean 8 ag. Ne << 
Then 
p(1) = lub{fig (11 < i < x} 
and 


p(x+1) = lub{fia (@(~)+1)|1 < i < a}. 
All of the given sequences are fundamental. 
{foll < @ < 0} is a sequence of normal functions on @. 


THEOREM 2.5 (Veblen [7]). Jf for every 0,1 < 0 < 4, 
Gime; = Vi, sv <0 

and 
Ovecnn(e) Vin 1 Vile 0}, 

then the function Axf,(1) is a normal function on 0. 


The sets Or(p) mentioned in the introduction will be unions of sets of the 
following sort: 


DEFINITION 2.4. Suppose Lim(). A Bachmann collection of ordinals of 
type < «x and bounded by z, 8(z, 11) is a set of ordinals such that: 
Bl. 0¢ B(x, ny) > 0 < a, 
B2. teB(z,n);0<n => OE B(x, yn) iff 0+1e Ba, pw). 
B3. If 6¢B(z, w) and Lim(@), there is defined a fundamental sequence 
DS[@] (the ‘distinguished sequence’ of @) such that: 
(a). Domain of DS[@] = T(@) < pu. 
(b). DS[@](i) ¢ B(z, w), 1 < i < T(6); 6 = lub in B(a, w){DS[O](/}. 
(c). If 1 <d <1(0), d= d'+1 and Suc(DS[0(d)), then 
DS[0](d) = DS[0](d’)+1. 
(d). Lim(e) and e < T(@) > Lim(DS[@](e)), T(DS[@](e)) = e and 
DS[DS[6](e)] = DS[alfe, 
(e). (‘Nesting property’). If Lim(6), 6 € B(z, ) and 
DS[0](i) < 6 < DS[0](i+1), then DS[0\(i) < DS[5](1). 
DS[@}(i) will often be abbreviated 6(i). 


If {f,|1 < « < 6} is a set of ordinal functions then if B<a< 0, Boo 
means Vf, S Vf;, B<a< 4. 
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THEOREM 2.6 (Bachmann [1]). Let F be a normal function and 0 a regular 
fixed point of F. Given 8(x, 0) there is a set of normal functions {F(0, —)| 
0 € B(x, 6)} such that: 

(1). Domain of F(0, —) = @; 

(2) sl = alae 

(3). F(O+1, —) = F(0, -Y; 

(4). if Lim(6), T(@) < 6, then VF(6, —) = (\ {VF(DS[@](i), —)I1 <i 

< T(9)}; 

(5). if Lim(@), T(@) = @, then F(@, —) = [AiF(DS[6](i), 1]. 
Furthermore, in cases 4 and 5 

(6). DS[0](¢@)+1 + DS[@](d+1). 

Proor. See Bachmann [1], p. 130. 


The family {F(0, —)|0 ¢ B(x, d)} will be said to be ‘based on’ B(z, 6). 


LEMMA 2.7 Suppose 1 € B(x, 0) and 1 < F(1). If ae VF(I, —) there is a 
last B* € B(x, 0) such that ae VF(B*, —). Ifa < F(x+1,1), thena = F(f*, 
b) where b < a. 

Proof. See Bachmann [1], Satz 1, p. 138. 


This lemma will be basic in our construction of normal forms. Although the 
normal forms will be defined in terms of normal functions based on Bach- 
mann collections, they will be expressible as finite ordered sets of ordinals. 
In order to develop these normal forms, we will consider Bachmann collec- 
tions with each member @ of which will be associated finite sets of ordinals 
called the ‘parts’ of 0. These finite sets will include the ordinals occurring 
in the normal form of 0. 

Notation. If Q is a set of ordinals and @ an ordinal, then Q < @ (or 
Q < 6) is an abbreviation for (x)[xeQ=>x < 0 (orx < #)J) andd< Q 
(or 6 < Q) abbreviates (Ex)[x € Q and 0 < x (or @ < x)]. 


DEFINITION 2.5. A Bachmann notational collection of type < 0, bounded by 
nm, 6*(x, 0)isa Bachmann collection with each member of which is associated 
two finite sets of ordinals P[9] and ND[@] (‘parts’ of 6 and ‘non-distin- 
guished parts’ of 0) and an ordinal D[6](‘distinguished part’ of @) such that: 
Pl. xe P[0] U ND[@]U{D[6]} = x e B*(x, 0), x < 0,x < dand,ifw <x, 
x is unequal to any regular ordinal. 

P22) P(e] = ND[6]U{D[6}}; 

(b). P[P[0]] = U {Pkxllxe P[A}} = PI6l, 
D[D[@]] = D[6@]. 
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et ¢ be a limit,e < (6). 
P3. ND[DS[9](e)] = ND[6]. 
P4. D[O(e)]e ND[O(e+k), 1 <k <a. 
PS (a). P{@(1)] = ND[6}; 
(b). P[@(d+)}] = P[@(d)], where d = 1 or Lim(d) and 0< k < o. 
P6. If 0(e+i)<o< O(e+i+1) and ge B*(z, 0), then D[@(e)}] e ND[co], 
Nee 7 gant 


Notation: If we wish to emphasize that B(x, 0) (or B*(x, 0)) isa segment we 
will write B(x, 0) (or B*(x, @)). In the course of this paper several different 
Bachmann notational collections will occur; the associated parts and sequen- 
ces will be variously indicated by P[I; @], P[q; 0], P*[< q; 8], DS* [< q; 0] 
etc. 


3. Normal forms for regular. ordinals 
DEFINITION 3.1. J is the first regular fixed point of G. 


The existence of J is guaranteed by AxNorm. Notice that if the axiom of 
choice is assumed, / is just the first weakly inaccessible ordinal. {6{1 < @ < 7} 
is a trivial example of a Bachmann collection of type < J, bounded by J (let 
DS[6] = Ai[i < 6]). Thus we can apply theorem 2.6 to B(/, 7) and obtain 
{G(6, —)|1 < 6 < J} where G(1, —) = GPI. 


DEFINITION 3.2. NF(/, 6), the Cantor normal form (relative to 
{G(0, —)|1 < @< 7}) for ordinals 0, 1 < 0 < G([+1, 1). 
(1). If 0 = 6’+k where Lim(6’), 1 <k <a, 


NF(; 6) = NF(/; 6’)+1+... +1, (& ones). 


(2). If G(p) < 0 < G(p+1), NF(/; 0) = <0), Lim(6). 
(3). If @¢ VG(1, —), B* is the ordinal given by lemma 2.7 and 
GG .))sithen, NEU; @) = ¢8*, b>. 


Notation. NF(I; 6) = G(B*, b) will mean NF(/; 6) = <6*, 5>. Thus 
60 = G(BT, b,) = G(B}, b2) is possible although NF(/; 0) = G(T, 6;) 
and NF(/; 0) # G(B%, 52). 

By defining suitable distinguished sequences and parts, we can show that 
{00 < Gl, p+1) < G(Z, 1)} is a Bachmann notational segment of type 
< G(l,p+1), bounded by G(1, p+1) ie. 


{010 < G(I,p+1)} = B*(G(I, p+1), G(I, p+1)). 
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J-distinguished sequences and /-types, DS[/; 6] and T(J; 6), can be de- 
fined inductively for all 6 < G(J+1, 1) by considering NF(I; 6). Except for 
the cases G(p) < 0 < G(p+1) and NF(/; 6) = GU, 1) the cases of the 
definition are almost identical to those of def. 4.5 and will be omitted. If 
G(p) < @< G(p+1) and 6 = G(p)+80’, we set DS[J; 6] = Ai[G(p) +i] 
and T(/; 0) = 6’;if @ = G(J, 1) weset TU; 0) =m and DS[J; 6] = Ans(n) where 
s(1) = GC, 1) and s(n+1) = G(s(n), 1). The definition of J-parts, P[7; 6], 
ND[J/; 0] and D[J; 6], coincides with def. 4.6 except where G(p) < 0 < 
< G(p+1) or 0 = GU, 1). If 6 = G(p)+ 8 < G(p+1) we let P[/; 6] = 
= Pll; G(p) lute} NDI 6) — PU; Gp) and Dif, 0] — 2. lt 0 Gia 
then P[J; 86] = ND[J; 0] = {1, w} and D[/; 6] = o. If 6 = G(p+1) then 
PZ 0] = ND[Z; 6) — PIE G(p))] and D6) — a. 

By induction on {6[@8 < G(/+1, 1)} while considering cases arising from 
NF(; @) one can establish properties B3(d), B3(e) and P1-P6 (def. 2.5). 
In the proof of B3(e) and P6 one makes use of the recursion relations: 

G(s,u) = G(s), u') iff (aes re ad eas ee) 

Or (D) sss anca cars 
OF (Cas = 5 and(G(s, 1) << 17. 

The following properties of J-parts (needed later) can also be derived: 

P2(c). PU; NDZ; 0) ND (ee; 

ND[/; Di; 6]] © ND[J: 6]. 
P7(a) > Suc( 0) ieee | ale 

(b)(i). DZ; 0] = w > TZ; 0) = @ or TC; 8) = G(g+1). 
(ii). DZ; 6] 4 w = (E!q)[G(q) < D[; 4] < G(qg4+1) and 
Di; 6] = G(q)+TU(; @)]. 

In both cases if Lim(e), e < T(J; #), then D[/;6(e)] = G(q)+e. 
P8. If either D[Z; 0] = @ and T(J; 0) = G(¢g+1), or D[J; 6] = G(q)+ 
TU; 6), then Pi EG = NDI ap 
Lemma 3.1 guarantees that in certain circumstances the distinguished se- 
quence of an ordinal can be extended. This ‘extension property’ will be pos- 
sessed by the Bachmann collections which we discuss in section 4. If a and 
b are ordinals, a < b, [a,b] = {ela < e < db}. 


LemMA 3.1. Suppose Lim(@), 0 <6 < GU/+1,1). Let d=T(U; 6) = 
D[J; 0] and assume G(q) < d < d* < G(q+1) where d* < Band d* <TU(I; B) 
if @ = DS[J; B)(d). Then if ND[J; 0JUND[J; B]n[d, d*) = 9, there is an 
ordinal 0*, 0* < B such that TI; 0*) = d*, DS[J; 6] = DS[J; 6*]fd and 
9 = DS[J; 6*](d). 
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Proor. We use a double induction on f£ and @. Notice that if we establish 
the existence of a suitable 0*, then 0* < f. For if 0 = 0*(d) < B < 0* we 
could conclude that either 6 = 0*(q), Lim (q), d < g = TU; B) < d*, or 
D[J; 6*(k)]e ND[J; 8], where d < k < d*. Since D[J; 0*(d)] = G(qg)+ 
TCU; 0) = d, it follows D[J; 6*(k)] = G(q)+k = k. Both cases are impossi- 
ble. 

We may assume 0 # f(d) for if it did simply let 0* = f if d* = T(J; B) 
or 0* = B(d*) if d* < TU; B). If G(g) < 8 < G(q+1) we would have 
@ = dand could take 0* = d* < G(g+1). There are two cases left to con- 
sider: NFU; 0) = G(y, ») and NFU; 0) = G(u,b+1), where Lim(y). In 
both cases xp < @ and ND[J; np] © NDJ[J; 8] implies the existence of p* 
with DS[J; »] = DS[J; w*]{d. In the first case, we let 0* = G(y, p*) after 
observing that x and y* must lie between the same consecutive fixed points 
of G(y, —). In the second case, we let 0* = G(u*, 1) if b = 0. 

b < G(p, b) is impossible for that would mean d= D[J; np] e PLZ; p] 
© P[J; G(p, b)] © ND[J; 0] (def. 4.6(6)). The remaining possibility is 
G(u, b) = b; hence NF(I; 5) = G(z,e) for w<a. If p< n= yp", let 
C2 —aGieere--)) and ih f <u" < _ let 0 = Gin b+1) p< an <p 
is impossible since this would imply veP{/; xz] S P[/; 5] Ss ND[/; 0] 
where Lim(v), d < v < d*. 

Replacing NF(J; 0) by NF* (J; @) in def. 3.2 and changing clause 3 to read 
NF*(/; 0) = <NF*(J; 6*), NF* (J; 5)> we can define the extended Cantor 
normal form of @ (relative to {G(B, —)}). 

Remark 1. If SU; 6) is the set of ordinals which occur in NF*(J; 8), 
dem SCG) = PR AEs 


4. Properties of Bachmann notational collections. Any ordinal @ lying 
between consecutive values of G has no interesting normal form NF* (J, 6). 
In this paper we will show how a large number of such ordinals < G(J, 1) 
can be assigned a normal form in terms of normal functions and smaller 
ordinals. In section 6 Bachmann notational collections of type < G(p+1) 
will be defined for each p+1 < G(J, 1); these collections will then be used 
as in theorem 2.6 to derive the set of normal functions which enter into the 
normal forms. This section and the next will be spent establishing those pro- 
perties of Bachmann collections which will be needed in their definition. 
We will use the abbreviation &(z, p) to stand for the Bachmann collection 
B(x, G(p+1)) where x < GU, 1) and G(p+1) < G(/,1); distinguished 
sequences and types of its members will be denoted by DS[p; @] and 
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T(p; @). In addition to B1-B3 we will assume that the following are true of 
See): 
B4. If 6 < G(p) and 0€B(a, p), then {ala < 0} & B(x, p). 
BS. If 06€B(z,p) and 0 < G(p+1), then DS[p; 0] = DS[J; 6] and 
WGI) y=: INGE), 
If the Bachmann collection 8(z, p) is, in fact, a Bachmann notational 
collection 6*(z, p), the parts of 0¢%*(z, p) will be denoted by P[p; 0], 
ND[p; 6] and D[p; 6]. In addition to P1-P6 (def. 2.5) and P7, P8 (with p 
replacing J), the following is assumed to hold for 8*(z, p): 
P9. If 6e B*(z, p) and 6 < G(p+1), then P[p; 6] = P[/; 0], ND[p; 6] = 
ND[J; 6] and D[p; 6] = DIJ; 6]. 

Given &*(x, p) we employ theorem 2.6 to generate {F?(0; —)|0 € B*(z, p)} 
where F?(1, —) = AxG(pY'}G(p +1). 

We first establish that recursion relations similar to those for {G(B, —)} 
hold for {F?(6, —)}. This is accomplished by a series of lemmas of which 
the most important is 


Lemma 4.1. If 0 < y and either 

(1). a = | and ND[p; 6] < F470 2c) = Dip; 2] 
or 

(2). P[p; 0] < F'(y, ¢) =a, 
then Fal Gsa)) — he (a): 


THEOREM 4.2 (Recursion relations; Gerber [4]). F?(6, a) < F?(y, c) iff one 
of the following conditions holds: 
(1). 0 < y and P[p; 6]U {a} < F?(y, c), 
(2). 0 =sy and aia. 
(3). y = 0 and F"(@,a) =< Pip ee, 
(4). » < 6 and (Ex)[{(x € P[p; yland1 < c)or (x € ND[p; y]and1 = c)} 
and x = F"(6; a): 


COROLLARY 4.2.1. F?(6,a) = F?(y, c) iff one of the following conditions 
holds: 
(1). 6 < y and either 
(a). 1 =< ater Ge mand Pipsele= s2 (nc): 
or 
(bj 1 =a, Dip; Gl 8e Gro) and Dp seller "(y2e), 
(2). Reverse roles of 0 and y, a and c in (1). 
Gl @ = yond a =r 
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DEFINITION 4.1. Suppose 2¢€8*(z,p). We define by induction on 
a < G(p+1) the Cantor normal form of a relative to 
{F*(0, —)|0 €B*(zx, p)}, NEF(x, p; a): 

(yt @ <1G(p SNE (pa) — NEV a) 

(2). Ifae VF7(1, —) let « be the last member of 8*(z, p) such that 
ae WF*(a, —); a= F(a, b). Then NF(z, p; a) = <NF(/; G(p))> <a, b>. 
(This will often be abbreviated as NF(z, p; a) = F?(a, b)). 

(3). If G(p) < a < G(p+1) and a¢ VF*(1, —) then a can be written 

to1 F?(a;,b;)m; +m, where m; < G(p) (and the summands are de- 
creasing). Then 


NF(q, p; a) = «<NF(x, p; F?(a,,5;))), 
<NF(J; m,)>+ ... + NFU; ms 1)>- 


Remark 2. If up < x, B*(u, p) is also a Bachmann notational collection. 
Suppose pe S*(u, p) and NF(u, p; a) = F?(0, b) where P[p; @]U{b} < a, 
by corollary 4.2.1 it then follows that NF(z, p; a) = F?(0, b). 

Using 8*(z, p) as we used {66 < J}, we now define the analogous set 
to B*(G(p+1), G(p+1)). 


DEFINITION 4.2. B*(z, p) is the set defined as follows: 
(mligreeG(p) then B-(. 7) = jala = 7); 
otherwise, 
(2). a < G(p) > ae B*(z, p); 
(3). if NF(z, p; a) = F?(0, 5), P[p; @]U{b} < a and 
P[p; 0]U{b} | B*(x, p), then ae B*(z, p); 
(4). if NF(x, p;a) = >} F?(6,, b;)m;+m then ae B*(z, p) if 
{F*(0,, b,)} © B*(x, ) 


Members of B*(z, p) will be indicated by a, b, c, etc. From remark 2 it fol- 
lows that B*(u, p) S B*(x,p) if w< 2 and pe B*(z, p). We will show 
B*(x, p) = B*(F?(x+1, 1), G(p)), a Bachmann notational segment of type 
< G(p), bounded by F?(x+1,1). The next theorem and its corollaries 
show that B*(z, p) is, in fact, a segment. 


DEFINITION 4.3. Suppose {6, 6} S B*(z, p) and 0 < B. Further suppose 
d = T(p; 0) = D[p; 0] and that d* is an ordinal such that G(q)<d< 
d* <G(q+1)<G(p+1) where d*<f and d* < T(p; B) if 6 = DS[p; B](d). 
Finally suppose (ND[p; 0J)UND[p; f])o [d, d*) = 9. If whenever these 
conditions hold there is an ordinal 6* € 8*(z, p), such that 0* < 8, 
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T(p; 6*) = d*, DS[p; 0) = DS[p; 0*\[d, and @ — DS[p; 6*|(@))etien 
%6*(z, p) is said to have the extension property (EP). 


THEOREM 4.3. If 8*(z, p) has EP and ae B*(x, p), then b <a implies 
be B*(x, p). ; 

PRooF. Suppose the theorem were false and let <a, b> be the smallest 
pair (lexicographically ordered) for which it is false. By using theorem 4.2 
and the definition of B*(z, p) one can show b = F?(0, 1) = DIp, @]. Next 
consider NF(z, p; a) = F°(0, c). Application of the recursion relations to 
b <a yields several possibilities. If 6 < 0 there must be x9 € P[p; 0]U {c} 
S B*(x, p) with b < x9 < a. No 6 = @ is impossible since b = D[p; 0] 
e P[p; 0] S B*(x, p). Hence 0 < @ and P[p; 0]U{c} < b. From the latter 
inequality follows ND[p; @]n[b, G(p+1)) = 9; since ND[p; 0] < 6 we 
have ND[p; é]a[b, G(p+1)) = 9 Suppose @ = DS[p; 6](b) where 
b < T(p; 9); if T(p; 0) # G(p+1) then, since 6 = F?(d,1), T(p; 0) = 
= D[p; 0je P[p; 6] < 5. Thus T(p; 6) = G(p+-1) which has as a consequen- 
ce that b = F?(6(b); 1) e VF?(@; —), ie. NF(x, p; 6) # F?(@, 1). The only 
option left is that 0 4 0(b). But then since 6*(z, p) has EP there is an ordinal 
a* < @ such that T(p; 0*) = G(p+1) and DS[p; 0*](6) = 0. This means 
b = F(0(b), 1)e VF?(d*, —) and contradicts the fact that NF(z, p; b) 
= F?(d, 1). 


CoroLiary 4.3.1. If 6*(z, p) has EP and P[p; 2] S B*(x, p), then 
B* (jp) = {ala = il) 


The next lemma and theorem will be used in the inductive definition of Bach- 
mann collections (section 6). Recall that if 6*(z, p) isa Bachmann notational 
collection, so is 8*(DS[p; z](i), p), and NF(DS[p; z](i), p; a) is the Cantor 
normal form of a relative to {F?(6, — )|@ e€ B*(x(i), p)}. 


LemMA 4.4. Consider DS[p;x](i) where 1<i< T(p;x) if T(p;2)< 
< G(p+1), and 1<i < F?(x,1) if T(p; x) = G(p+1). Then iff a< 
< Plp; x(i)] and NF(x(i), p; a) = F?(0, 1) = T(p; 0), it follows that 
NF(z, p; a) = F?(0, 1). 


DEFINITION 4.4. A,, (a), the ancestors of a (relative to 
{F?(B, —)|B € B*(x, p)}) for a < G(p+1): 

(1), If a < G(p), Ay, (a) = {a}. 

(2). If Plp; 0]U{b} < F?(6,b) = NF(z, p;a), then 


Ay, A4) = Ay Eps el) See Oe Pip cl one: 
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If F°(6, b) < Pip; 6]U{b}, A,, (2) = 9. 
(3). If NF(z, p; a) = >) F?(0;, b;)m,;+m then 


Ar, (4) = J Ax, F(8:, bi) U {my -. . m}. 
Remark 3. Notice that a € B*(z, p) iff A, ,(a) # 9 and A,, ,(a) S B*(z, p). 


THEOREM 4.5. Suppose 8*(x, p) has EP and P{p; x] S B*(n, p). Then 

(1). if T(p; x) < G(p +1), P[p; x@)] S B*(x(i), p), 1 < 7 < T(p; 2), and 
if T(p; x) = G(p+1), then P[p; x(i)] S B*(x(i), p) for 1 <i < F(x, 1); 

(2). if T(p; x) = G(p+1), then (0)[0 ¢ B*(x, p) and DS[p; n](F?(x, 1)) < 
0a bp; @ |) ea (G. Pp) 

Proor. (1). Suppose x9 € P[p; x(i)] and x, ¢ B(x(i), p). By remark 3 
there is some smallest ancestor a < xy such that a ¢ B*(x(i), p). Hence 
INE (1), p32) = F"(0,2) or NE(n(t), p; a) = F?(0, 1) = D[p; oe] =a, 
(corollary 4.2.1). In the first case a < P[p; x(i)] < F?(x(i), 1) > ae VF? 
(6+1, —). No. Using lemma 4.4 in the second case, one derives 
NF(z, p; a) = F?(0, 1) = D[p; @] ie. a¢ B*(x,p). By using properties 
P1-P7 (def. 2.5) one sees a < P[p; x(i)] < P[p; z]U{F?(z, 1)}. 

As P[p; x]U{F?(z, 1)} S B*(x, p) the contradiction ae B*(z, p) follows 
from theorem 4.3. 

Q\e NE GG, 1))) p; Fea, 1)) =F’ Gc(F? (x, 1)), 1) = F(x, 1) = 
D[p; x(F?(x, 1))] implies P[p; x(F?(x, 1))] & B*(x(F?(x, 1)), p). Suppose 
F(x, 1)¢ VF?(0, —) for some smallest 6, n(F?(x,1)) < 0 <2. Then 
T(p; 0) = G(p+1) and F?(n, 1) = F?(0(F?(x, 1)), 1). F(x, 1) € VF?(6(F? 
(x, 1)), —) implies 0(F?(x, 1)) < x(F?(x, 1)) < 8 < x. Application of the 
nesting property yields the contradiction 0(F?(x, 1)) < n(F?(x,1)) < @(1). 
Thus F?(z, 1)¢VF?(0, —) and hence NF (0, p; F?(x, 1)) = F?(x(F?(z, 1)), 
ID]: a FP, 1), eer (n, 1) ¢ B*(0, p) for n(F?(n, 1))< 0 < zx. 
But x(F?(x,1)) < @< 2 implies either 6 = x(e) where Lim(e) and 
F’(n,1)<e or n(e+k) < 0 < n(e+k+1), Lim(e) and F(x, 1) < e. In 
the first case e = D[p; 6] and in the latter ee ND[p; 6]. But this means 
P[p; 0] € B*(0, p), for as B*(xz, p) has EP, so does 8*(6,p) and thus 
(theorem 4.3) ee B*(0, p) would imply F?(z, 1) € B*(8, p). 


For the rest of this section we shall assume that 8*(z; p) has EP, neB*(z, p), 
and P[p; x] S B*(z, p). We first show that the segment B*(z, p) is a Bach- 
mann collection by defining sequences of length < G(p). 
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DEFINITION 4.5. We define DS*[< p;a] and T*(< p;a) inductively for 
ae B*(n, p): 


(1). 
(2). 


lf a< G(p), DS*|< 7; a) = DS[f, a] and 1*( <p: c) — ie 

(i). NF(z, p; a) = F?(0, b)m > DS*[< p; a] = F?(0, b)DS[Z; m], 
piatol Mae feo) = INE e))) 

(ii). NE(z, p; a) = F?(1,b+1) = DS*[< p; a]= F?(1, b)DS[Z; G(p)], 
and T*(< p; a) = T(J; G(p)). 


(iii). NF(x, p; a) = >) F?(0;, b;))m,;+ 6 => 


(3). 


(4). 


(5). 


(6). 


DS*[< p; a] = >) F?(6;, b;)m,;+DS*[< p; 4], 
and. 0*( =< pra) e(<<ino) 
If NF(a, p; a) = F?(0+1,64+1),0 < band 1 < @, then 
DS*[< p; a] = Ang(n), 
where 
g(1) = F°(0,2) if b = 0, 
g(1) = F?(0, F®(6+1, b)+1) if 0 < 4, 
g(n+1) = F?(0, g(n)), 
Bhatel (P< jo, Gi) = 
If NF(z, p; a) = F?(0, b) with Lim(d), let b = ft+e where f is the 
last fixed point of F?(6, —) preceding b, 0<e<b<a. As B*(x, p) 
is a segment, e € B*(z, p). Then 
DS*[< p; a] = F?(6, 1+f+DS*[< p; e]), 
and T*(< p;a) = T*(< p;e). 
NF(x, p; a) = F?(0, b+1) where 0 < 5b, Lim(6), and T(p; @) = 
= G(p+1), then 
DS*[< p: al = Anhi(n), 
where 
h(1) = F?(0(2), 1) if b = 0, 
h(1) = F?(0(F?(0, b)+1), 1) if 0 <8, 
h(n+1) = F?(0(A(n)), 1), 
and! © *(= pa) —aoy 
NF(a, p; a) = F?(0,b+1) where Lim(@) and T(p, 0) < G(p+1). 
If D[p; 0] =a (so T(p; 6) = @ or T(p; 0) = G(g+1)< G(p)) or if 
w # D[p; 0] < G(p), we let 
DS*[< p; a] = AiF?(0(1 +1), F?(0, b) +1), 
and T*(< pia) — Ue), 
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Else D[p; 0] = G(p)+T(p; 8) and T(p; 0)¢ B*(n,p). If 0 < 8, let 
DS*[< p; a] = F?((1+DS*[< p; T(p; 9)]), F?(8, b)+1) 
and 
T*(< p; a) = T*(< p; T(p; 9). 
If b =0 let f be the last fixed point of the normal function AiF?(O(i), 1) 


preceding T(p; 9), thus T(p; 6) = f+e where e < T(p; 0) and 
e€ B*(x, p). Then 


Tapa) = 1*(= pe). 
and 
DS*[< p; a] = F?(0(1+f+DS*[< p; e]), 1) 


Notation: DS[< p;a](i) will often be written a(i). Note that a = lub 
nee = TA pe) 


Lemmas 4.6 and 4.7 are used in establishing that NF(z, p; a(i)) = a(i), 
a fact needed in the proofs of theorems 4.8 and 4.9. 


Lemma 4.6. (1). Jf NF(z, p;a) = F(a, b), Lim(b), and b(1)<c< b, 
hentia <= F(a, c). 

(2). If NF(a, p;a) = F(a, 1), Lim(a), D[p;«] = G(p)+T(p; «), and 
Mpae\ <= U(p;a), then c= FP(a(c), 1). 


Lema 4.7. (1). [fa = F(a, B) and1 <b < F*(a+1, 1) or F¥(a+l, d)< 
<b < F°(a+1,d+1), thna¢ VF?(0, —) fora<O0< 7, 0E€B*(x, p). 

(2). If a = F*(a(e), 1) where 1 <e < T(p; a) and e < F?{a(e), 1), then 
a¢ VF*(60), —) fora<@< 1, 0€8*(n, p). 


Corotiary 4.7.1. If a¢ B*(x, p), NF(x, p; a) = F(a, b) and either 1 < « 
or 1 = « and Lim(b), then NF(z, p; a(i)) = a(i). 


We now use these lemmas to prove that properties B3(d) and B3(e) hold 
for B*(x,p) and hence that B*(x,p) is a Bachmann collection of type 


< G(p). 
THEOREM 4.8. Suppose a € B*(n, p) and e is a limit, e < T*(< p;a), then 
Lim(DS*[< p;a}(e)), T*(< p;a(e)) =e, and DS*[< p; a(e)] = DS* 


[< p; a}fe. 
Proor. By induction on 8*(z, p) using the preceding corollary. 
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THEOREM 4.9. (nesting theorem). Suppose a, d are limits, {a,d} SB*(z, p), 
and a(u) < d < a(u+1) where 1 < u < T*(< p; a); then a(u) < d(1). 

Proor. The structure of the proof is a double transfinite induction: 
the primary induction is on {a|Lim(a) and a € B*(z, p)} and the secondary 
induction on {dla(u) < d < a(u+1) and Lim(d)}. Occasionally a subsi- 
diary induction on u is needed. It is a consequence of theorem 4.2 that the 
inequalities a(u) < d < a(u+1) can arise only because of certain ordering 
relations between the p-parts and the arguments of the normal forms of 
a(u), d, and a(u+1). The proof of the theorem as one considers each of 
these several possibilities is tedious but not difficult. Full use is made of 
lemma 4.6, corollary 14.7.1 and of the properties P1—P9 for p-parts. 


To complete the task of demonstrating that B*(z, p) is a Bachmann nota- 
tional segment, we now define parts for a € B*(z, p) and show that analogues 
of P1-P9 hold for them. 


DEFINITION 4.6. We define inductively P*[< p;a], ND*[< p;a] and 
D* (<p: alior @e B(x, p): 
(1) (i). a=a' +1 > P*[< p; a] = ND*[< p; a] = P*[< psa’), 
and Dal< 7 al— 
(ii). a < G(p) => P*[< p;a] = PI; a], 
ND? (<97, 4) NDIZ.al 
and D*|< pial = D[i: al: 
If G(p) < a, P[J; G(p)] will always be included in P*[< p;a] and ND* 
[< p; a]. 
(2) (i). NF(a, p; a) = F°(0, b)m => 
P*[< p;a] = P*[< p; F?(8, b)JUP[; m], 
ND*[< p; a] = P*[< p; F?(0, b)JUND[Z; m], 
and D*[< p; a] = Dil; 7]. 
(ii). NF(x, p;a) = F?(1,b+1)=> 
P*[< p;a\= Pl pF o)| NDA pa] 
and D*[< p; a] = D[J; G(p)]. 
(iii). NF(x, p; a) = )) F?(0;, 6,)m,+a' > 
P*[< psa] = (J P*[< p; F?(8;, b;)m;JUP*[< p; a’), 
ND*[< p; a] = ) P*[< p; F?(6;, b;)m;JUND*[< p; a’] 
Dl 7.4) — la |: 
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(3). NF(z, p; a) = F°(6+1,64+1),1< 6=> 
P* |= p50) = ND* |< pra) — 
Bop Pp reer |-apee(C-ie.b))), 
Di) — 
(4). NF (zx, p; a) = F°(0, b) > b, Lim(b) > 
ea op, Pip; e)JUP*(< p; a), 
ND*[< p; a] = P*[< p; P[p; @]JUND*[< p; 6], 
D*[< p; a] = D*[< p; 5}. 
(5). NF(x, p; a) = F?(0, +1), Lim(@) and T(p; 0) = G(p+1) => 
Pe 2 cl eND 29) — 2 <p; P[p; @]JuP* (= p; 6], 
i) 2p a) a 
(6). NF(a, p; a) = F°(0, b+1), Lim(0) and T(p; 0) < G(p+1)=> 
Bal 2; 2) B= parle. jor <p, F°(0, 6)], 
ND*[< p; a] = 
ep, NDP; GND |< i9; Dip; OOP = p, P20, b))), 
Dep 7a\— D* |p; Dp, 0), 


Remark 4. Both this definition and def. 4.5 depend on NF(z, p; a).Remark 
2 shows that if ae B*(x,p) then NF(z, p;a) = NF(f,p;a) for x < B. 
Consequently DS*[< p; a], P*[< p; a], etc. are independent of the ordinal z. 
One can now show that properties PI-P9 are true for P*[< p; a], 
ND*[<p; a], DS*[< p; a] etc. The proofs of all of these except for P6 are 
straightforward; one inducts on ae B*(z, p), invoking case analyses accord- 
ing to the possibilities of NF(zx, p; a), and employs properties B1-B5 and 
P1-P9 for 0 € 8*(x, p). 


5. Redefining sequences and parts of members of 6*(z, p). We now show how 
sequences of length < G(p) and parts < G(p) can be defined for certain 
members of 8*(z, p). 

Assume $*(z, p) has EP and x € 8*(z, p). 


DEFINITION 5.1. Suppose P[p;z] & B*(z, p). 
(1). Bios P< p, Pip.) 
ND[< p; x] = P*[< p; ND[p; x]] U ND*[< p; D[p; 7]], 
and 
IIS (05 || Ds oy Dad al 
(2)(a). If D[p;x] = o, then T(p; 2) = or T(p; x) = G(q+1). 
If T(p;z) = wor G(g+1) < G(p), then 


DSi; 2) — DS[p: x] 
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and 
T(< p; x) = T(p; 7). 
If T(p; x) = G(p+1), then 
T(< p32) =e 
and 
DS[< p; 2] = AnL(n) 
where 
L(1) = Dsip. ale) 
L(n+1) = DS[p; 2](F?(L(n); 1)). 
(b). D[p; x] # w and D[p; 2] = G(q)+T(p; 2). 
If g < p, then 
T= ps) = Vp: a) 
and 
DS[< p: a) = DS|p- a): 
Hg — p, then 
T(< p; 2) = T*(< p; T(p; 2)) = T*(< p; Dip; 2]) 
and 
DS[< p; x] = DS[p, 2] 0 DS*[< p; T(p; 2)]. 
Theorem 4.5.1 says that if P[p;] < B*(x,p) then P[p; DS[< p; z](i)] 
S B*(DS[< p; ](i), p), hence 
DS[< p; DS[< p;z]{i)], PL< p; DS[< [p; 2]()], ete. 
are all defined. 

As with P*[< p;a], DS*[< p; a] etc., one now proves that properties 
P1-P9 hold for P[< p; 2], DS[< p; 7], etc.. The proofs are all straight- 
forward: one examines cases corresponding to those in def. 5.1 and uses 
the fact that properties B and P hold for P[p; 2], DS[p; x], P*[< p; a], 
DS*[< p; a], etc.. The next lemmas are established in the same way; from 
them properties B3(d) and B3(e) follow. 

LemMa 5.1. Suppose Lim(t), t < T(< p; 2), then DS[< p; DS[< p; z](t)]= 
= DS[< p;2][t and T(< p; DSipsal@)) == 
LemMA 5.2. Suppose 0€%8*(n,p) and P[p; 0] S B*(x, 0), then if 
DS[< p; m](c) < 0 < DS[< p; n](c +1), 

DS[< p; 2](c) < DS[p; 0](1). 
The final result of this section states that a uniform bound can be placed 
on the size of < p-parts of DS[< p; 2](i). This will be used in the inductive 
definition of the next section. 


Lema 5.3. P[< p; DS[< p; 2](i)] < Pl< p; n]U{T(< p; n)}. 
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6. Defining a sequence of Bachmann notational collections 


THEOREM 6.1. For each x,1 <x < G(J/,1), there is a collection of sets 
{B*(n, k)|l <k < GC, 1)} and, if x is a limit, a finite sequence of ordinals 
Y (2) = (415+ + +s Imm» 1 < m(x) < @. The collection and sequence have 
the following properties: 


(1). B*(z,k) S G(I, 1) and 8*(x, k) is a Bachmann notational collection 
with EP of type < G(k+1), bounded by nx. 
(2). B*(1, k) = {1}, P[k; 1] = ND[k; 1] = {1} and D[k; 1] = 1. 
(3). x ¢ B*(z, kK) > B*(x+1, k) = B*(z, k), 
mE O*(n, k) > B*(x+1,k) = B* (a, k)U {a+ 1}, 
P[k; x+1] = ND[k;z+1] = P[k; z] and D[k; x+1] = 1. 
(4). Suppose Lim(z) and G(p) < x < G(p+1): 
(0d, — 4n-1 <== 4a = 8 
(ii). Pg; 7] S B*(x, q;) and Ggi+1) < PL< qi mIV{T(< 9:5 2)} < 
< G(gi4,+1), 1 <i< m-1. 
(iti). Plq,3 7] & B*(%; Gm) OF Gm = 1. 
(b)(i). K < dm > B*(a, k) = oc B*(9, k); 
din < k= B¥(n, k) = YpceB*(0, KU {a}. 
Ca pee OU) — Pie) = PU a7)! ND[k: x)= NDI 7], 
D[k; 2] = Disa], Tk; 2) = T(I; 2), and DS[k; x] = DSU; z). 
Cie) ganda) = mi — 1 = Pl js n| = Pl< q;; x); 
ND[j; z] = ND[< q;; 2], DL; x] = D[< 93; 7], 
T(j; 2) = T(< qj; 2) and DS[j; x] = DS[< q;; a]. 
(5). 2 € B*(x, k) only as given by (2), (3) or (4). 


Proor. By induction on z, 1 < x < G(J,1), we shall define the collec- 
tion {B*(x, k)}. At the zth step in the induction (and only at this step) 
7 will or will not be put into the various 8*(z, k) depending on whether one 
of the conditions (2), (3) or (4) is satisfied or not. 

Suppose the sets {6*(0, k)|1 < k < G(J, 1)} have been defined for 0 < z. 

When z = 1 or z = x’ +1 the definition of 8*(z, k) is given by (2) and 
(3). Suppose Lim(z) and G(p) < x < G(p+1). We now proceed to define 
the sequence )’ (x) = <q,,.--,4m_> and to prove that the sets 


hee De(Gaein: = OX(n,k), Gn = ks 
are Bachmann notational collections with EP. 
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Set q, = p. By induction on 6 < z one sees that if G(r) < 0 < G(r+1) 
and r < k, then @€%6*(0,k). Hence 8*(x, k) = {0|0 < x} for gy =a 
If one defines parts, types, and sequences as in (4bii), it follows from the re- 
sults of section 3 that 6*(z, k) is a Bachmann notational collection with 
EP of type < G(k+1), bounded by z. 

Suppose qg,,...,4,, and 8*(z, k) have been defined and shown to have 
the appropriate properties. Assume that P[g,; 2] € B*(g,; 2) and that 9,4, 
is that unique ordinal for which G(q,4,) < PL< 9,32] U{T(< q,;7)} < 
< G(q:+,+1). Several lemmas of which the three below are most im- 
portant, combined with the results of section 6 suffice to prove that if parts 
and sequences are defined as in (4biii), 8*(z, k) is a Bachmann notational 
collection with EP (of type < G(k+1), bounded by z) for 9,4; < k < q. 
To this end recall from theorem 4.5 that if v = DS[< q¢,;2](i) 1 <i<T 
(< 9,3 z)), then P[q,;v] S B*(v, g,). Hence Pi< g,;v], DS|< g,, 0], ees 
are all defined. 


LEMMA 6.1.1. Let 9,4, < k < q,, then 
(a). ve B*(z, k). 
(b). P[k; 0] = Pl< q,; 0], 
ND[k; vo] = ND[< q,; 0], 
and D[k; v] = D[< 4,; v}. 
(c). If Lim(e), e < T(< q,;2) and v = DS[k; x](e) = DS[< q,; z](e), 
then T(k; v) = e and DS[k; v] = DSIk; z]fe. 


LEMMA 6.1.2. 2 = lub in 8*(x, k){DS[< 9,3 2](I1 < i < T(< 4; 7)}. 


LemMaA 6.1.3. 6*(z, k) has EP for. g,41<k < q. . 

Proor. See def. 4.3 for notation. If B < x, then {0, 8} © B*(f, k); by 
induction, 6*(f, k) has EP. Thus we can assume 0 < f = zx. Further note 
that as a consequence of lemma 3.1, z, 6 can both be taken > G(k+1). 
If Lim(z), then 6 < DS[k;z](d) for some d < T(k;z). The proof is 
completed by using the induction hypothesis to show that an appropriate 
6 € B*(DS[k, z](d), k). 


The definition of the sequence q, > gq. >... > 4,4, can be continued, 
but after a finite number of steps either q,, = 1 or P[q,,3 7] & B*(z, q,,)- 
Bork <q,,.set 3" (7, ) —"\)aees (aye) 


Clearly {) {8*(0,k)|0 < GZ, 1)} is a Bachmann notational collection 
with EP for each k, 1 < k < G(J, 1). The collection 
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Or(k) = U (B*(6, HIG < GU, D}U{G, Y} 

is also a Bachmann notational collection with EP if we set DS[k; Ge 1))— 
= Ans(n) (see vsectionme) ands P[k. Gi!) — ND[L GU. 1)) — {leo 
D[k: G(Z, 1)] = @. 


7. Use of the sets Or(k) to define normal forms 


Derinimone le lem Gil) aud Sia) —<gy..., 9.>; G(g,) <n < 
G(q,+1). For each t, g, < t, we will define PNF;(z) (the ¢th level pred- 
icative normal form of z) by induction on x. PNF'(z) will be an ordered set 
of ordinals and the ordinals in it form the set S(t; z). We will show that 
Sea le oe: 
(0). PNF'(G(Z, 1)) = </,1) for all t, 1 < ¢ < G(41). 
(rege 1 G(/, 1), then 
PINE (Gc) = NE Ue); 
SU; 2) S PI; zJu{T} 
(see remark 1). 
(2). We can assume PNF' is defined for g, < t and that 


Sn ie) Ses eee 
By the main induction if @ € Or(q;) and 6 < x, then PNF*(6) is defined and 
S(qi3 8) S Plays OJU{Z}. 
Suppose P[q;; 7] S B*(z, qi). 

(2.1). For each ae B*(x, q;) we will define (inductively) NF*(z, 9,3 4); 
this will be an ordered set of ordinals and the ordinals in NF* (x, g;; a) will 
form the set S(q;; a). It will be shown that S(q,;; a) S P*[< q;; aJU{T}: 

(a). Ifa < G(q;), 
NESGrg,.c) — NE (@. a), 
and 
S(q;; 4) = SU; a) S Pl; a] vu {I}. 
(b). Suppose G(g;) < a < G(q;+1) and NF(z, g;; a) = F°(0, 6) > b; 
by induction NF* (x, q;; b) is defined and 


S@;; 6) S P*[< 4,3 BJU {T}. 
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(i). 6 < a, NF*(z, q;; 8) is defined and S(q,; 0) S P*[< q;; bJU{T}. 
Then 
NF* (a, q3. a) = <<NF*(L; G(q;))>, <NF* (a, 93; 0), NF* (a, 935 5)>>, 
S(q:3 4) = SU; G(qi))US(qis 9)US(4:; 5) 
S PL; G(q,)JUP*L< 94; OIUP*[< 935 blu} 
= P(g elo ne 
(i) a <= @) Wet PNE"(0)— 20 eee o,) where 
S(qi; 9)— {7} = {b1,..., bg S Pla; 1. 
As P[q;; 9] <a we have {b,,...,5,} S B*(x,q,) and NF*(z, 
qi; b;) is defined with 


S(91;5;) S P*L< 935 bj). 
If we let NF*(z, q;; J) = I then 
INFO (4); 4) = 
<CNF* (I: G(q;))>, <R(NE*G:, 97 07))-. NE Gu g.50,)), NE (a ¢;, ye 
S(qi; a) = SU; G(g;))US(qi; b1)U . .. US(qi; 5) 
= Pls G(@)lOP* (<9; 2 joe El glo 
= P*[< 93 alu {I}. 
(2.2). For gi41 < t <q, PNF(z) is defined as follows: 
(a) i p=, nes € Ban, oh), 
PNE (z= NE (a, 
Clearly 
S(t; a) S PY gale) — Blin. 
(b). Otherwise let PNF"(z) = v(a,,...,4,), where 
S(qi3 n)— {I} = {a,, ee ey An} = Plais 7]. 
Then 
PNF' tz) = o(NEF (Gig, 27 eee cea ae) 
and 


S(t; 2) = Sq a,)U... VSG a,)U{T} S 
S P*|< gpa, JU eP* (<9 4.\0 = 
= Pi< ger tn— P[e noir 


for gi41 St < qj. 
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Remark 5. Notice that if {0,2} S Or(p), 0 < nandae B*(0, p) & B*(z, p) 
then NF*(6, p;a) = NF*(a, p;a). This is a consequence of the fact, 
(remark 2) that NF(0, p; a) = NF(z, p; a). 


DEFINITION 7.2. Let x < G(J, 1) and X(x) = <q,,..-, g,>. The predicative 
normal form of x (relative to {Or(k)|1 < k < GZ, 1)}), PNF(z) = 
= PNF*(z). 


Note that B*(G(Z, 1), p) = {ala < F?(G(UZ, 1)+1, 1)} for 1 < p < GZ, 1). 


THEOREM 7.1. If ae B*(G(I, 1), p)U{F?9(G(Z, 1)+1,1)} then a can be 
written uniquely as an expression made up from (I, <, >, >, 1, +}U{e|Lim(e) 
and e < G(p)}. This will be called NF(p; a). 

Proor. If a < G(p), NF(p; a) = NF*(J; a). 


NF(p; F°(G(L, 1) +1, 1)) = «<NF(p; G(p))>, <<L.1> +1, >. 
Otherwise suppose 
NF(G(/, 1), p; a) = F?(0, b), 
PINT 200) = (Opera Oy) 
where 
S(p; 0)— {7} = {b,,...,,} <a. 
Then 


NF(p; a) = <<NF(p; G(p))>, <A(NF(p; 5;), ..-, NF(p; 5,)), NF(p; 5)>>. 


One consequence of the recursion relations and the method of definition 
of the sets 8*(x, p) is that the ordering F'(«, a) < F’(f, b) is recursive for 
countable ordinals < F'(G(J, 1)+1,1). This is a special case of a more gene- 
ral result: The ordering F?(«, a) < F?(B,b) can be effectively determined 
relative to the order relations on a finite set of limits < G(p). To get this 
one shows that if 0, x € 8*(z, p) then the relation 0 < x can be effectively 
determined from the order relations of P(p; 2) UP[p; 6] (and the order re- 
lations of G). 

In concluding this section, we would like to compare our countable limit 
F'(G(Z, 1)+1, 1) with that of Bachmann, 


Pro,esaT) 
In our notation F,,.,(1) = F?(G(1, 3)+1, 1) ¢8*(G(Z, 1), 1) and 


Pre,+1)l) = eG, Sabi, 1), 1). 
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From the recursion relations 
P(r (Gi, 3)+1, By l= Ee a(GC. 1)+1, i): 


In the same way, the limit one obtains by generalizing Bachmann’s proce- 
dure to make use of Q,, n < @, (see for example, Pfeiffer [5]) is still 
< F'(G(, 1)+1, 1). 


8. Applications to proof theory. Various consistency proofs and cut elim- 
ination theorems are carried through by means of an induction on the 
ordinal (in some well-defined sense) of the proof tree. Frequently an ordinal 
is coordinated with a tree by assigning particular ordinals to axioms and then 
specifiying for each rule of inference a function which determines the ordinal 
of the conclusion given the premiss and the ordinal of the premiss. In most 
cases this function can be taken to be one of the functions F?(0, — ) described 
here. For example, consistency of number theory is proved by using F’(1, —) 
fé, while to prove consistency of ramified analysis of ramification level 
do < @,, one needs {F'(B, —)IB < a +2}. 

We now briefly indicate how the functions described in this paper also 
can be used in place of ordinal diagrams to carry out Takeuti’s proof of 
the consistency of the system SINN. Let the definitions of ‘semi-isolated’, 
‘grade’ and ‘proof-figure with degree’ be the same as in [6]. Consider a 
Hilbert type second order system containing /-terms. Axioms are (1) tau- 
tologies, quantifier-free verifiable formulas, equality axioms, and 4-axioms 
(t € AxU(x) — W(t)) and (2) E-axioms, (ZT) > EXG(X), for number and 
set terms X, U where T is semi-isolated if T is a set term. Rules are induction, 
modus ponens, the E-rule: from (A) => K conclude EX9(X) = K; and 
set substitution: from %(A) conclude %(&). Takeuti’s proof (which, can be 
easily translated into this formalism) shows how a proof tree with endfor- 
mula 1 = 0 could be altered by a succession of reconstructions to yield a 
proof of 1 = 0 from tautologies and verifiable formulas. To prove that this 
procedure terminates, Takeuti assigns ordinal diagrams to the tree and then 
shows that the reconstructions decrease the ordinal diagrams. Instead of 
ordinal diagrams, one can assign classical ordinals as follows. Every axiom 
except the E-axioms is assigned 1 or 0; §(Z) > EX%(X) is assigned 


Bae nt) b): 


If the premiss of an E-rule has ordinal 0, the ordinal of the conclusion is 
0@1; if the major and minor premisses of an induction have ordinals @, «, 
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the conclusion is assigned «+ 0q. If the ordinal of % is «, § = © is @ and 
the grade of = 0 > grade of G = z, then the ordinal of & is F°(0, «@ 6). 
Finally, if the degree of a substitution is k < w and (A) has ordinal a, then 
%(Z) is assigned F°(F*(«, 1), 1). The crucial point in checking the correct- 
ness of this assignment is to notice that it leads to a decrease of the proof or- 
dinal during Takeuti’s ‘essential reduction’. This is a consequence of theo- 
rem 4.2. 

Notice that reference is made only to ordinals < G(w+1). By using the 
regular ordinals G(1), G(2),...,G(@+1), one can define the sets 
B*(G(@+1), k), 1 < k < @; the functions F* based on these sets would be 
sufficient to carry out the proof. Thus at least for the purpose of Takeuti’s 
theorem, we do not need the rather strong assumption of the existence of J. 

We have succeeded in pushing the construction much ‘further along’ by 
making use of Mahlo ordinals (see Gaifman [3]). What is still needed is a 
reformulation in a more elegant way which eliminates the tedious detail and 
case analyses. Perhaps the approach to this lies along the lines of Feferman’s 
recent work [2] or in the discovery of a suitable concept of normal form. 
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FORMALIZATION OF THE THEORY OF ORDINAL DIAGRAMS OF 
INFINITE ORDER ' 


AKIKO KINO 


In this paper we shall develop a formal theory of some systems of ordinal 
diagrams which have been used to prove the consistency of subsystems of 
classical analysis. By exhibiting a formal system in which these consistency 
proofs can be formalized, in particular one which suffices to prove the well- 
ordering of a given system of ordinal diagrams, one can use Gédel’s incom- 
pleteness theorem (in the form that no system can prove its own consisten- 
cy) to form an estimate of what the limits are of the power of such methods. 
For example, if we construct a formal system S in which the well-ordering 
of ordinal diagrams of finite order can be proved, and if we prove in some 
subsystem T of analysis that S is consistent, we know that T cannot be prov- 
ed consistent by ordinal diagrams of finite order. 

Takeuti introduced diagrams of finite order in [5]. In [9] he generalized 
them to ordinal diagrams constructed from well-ordered sets J, 4 and S and 
denoted these by Od(J/, A, S) (or O(/, A, S) for the system in the narrow sen- 
se). It is easily shown (cf. [4]) that all systems Od(J, A, S) (or O(/, A, S)) 
can be embedded in systems Od(J/’, A’, 0) with @ empty (or O(/’, A’, 9)), 
which we shall henceforth denote by Od(J’, A’) (or O(/’, A4’)) simply. In 
order to give the reader some idea of the strength of these systems O(/, A) 
let us recall that O(n, w) corresponds to ordinal diagrams of order n. When 
Tis infinite, even very simple, we get systems of ordinal diagrams sufficient to 
prove the consistency of a fragment of analysis: for example using the well- 
ordering of O(w+1, w*) we can prove the consistency of IT j-analysis (cf. 
Takeuti [11]). This motivates us to study how we can formalize, for given 
primitive recursive J and A, the theory of Od(/, A) in such a way as to prove 


1 Part of the work herein described was supported by NSF Grant GP-8918. The author 
would like to thank Professor G. Takeuti for his advice in the preparation of this work 
and Professor J. Myhill for his stylistic advice. 
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its well-ordering”. In §§ 1 and 2 is given the definition of Od(/, A) and its 
orderings <,(i¢ J) and a proof of well-ordering which is similar to the one 
in [5] as mentioned in [4]. 

If a relation < orders a set S and ae S, we call a S-<- accessible if the 
subset {b|b < a} of Sis well-ordered by < (in the sense of transfinite induc- 
tion). Then the following propositions can easily be shown: 


PROPOSITION 1. If every b < a is S-<-accessible, then so is a. 
PROPOSITION 2. If a is S-<-accessible, then so is every b <a. 


PROPOSITION 3. If a and b are S-<-accessible, then so is the natural sum 
a # b of a and b. 


The purpose of this paper, which is achieved in § 3, is to present a formal 
system which is in some sense constructive and in which the above mentioned 
proof of well-ordering of Od({/, A) can be formalized. We begin with in- 
tuitionistic arithmetic, i.e. Peano’s axioms plus primitive recursive definitions 
plus the intuitionistic predicate calculus. It is natural to take the notion 
‘~< is a well-ordering’ as a clear and secure one (cf. [2]): indeed it is custo- 
mary to add to intuitionistic arithmetic such a pair of schemata as 


ae SAVx(x < a> Ac(S, <, x)) > Ac(S, <, a) (1) 
and 
Ac(S, <, a)AVy(y € SAVX(x < y > U(x)) > ALY) = Ala) — (2) 


where < is a decidable relation (in particular a primitive recursive one) 
and S a decidable set which is the field of <. 

The resulting system is close to the intuitionistic system called IDK (cf. 
[13]) but without variables for constructive functions. Only number-varia- 
bles occur. We know from results of Howard [14] and Gerber [15] that this 
system, although much stronger from the point of view of provable well- 
orderings than intuitionistic arithmetic without the added schema, is not 
strong enough for the present purpose: for they showed that the primitive 


? Although only systems of the form O(/, A) are used in consistency-proofs and special 
cases of the fundamental conjecture of GLC, and although also O(/, A), a subsystem of 
Od, A), has a much simpler structure than Od(/, A), we seem nevertheless to be able to 
develop a formal theory of Od(/, A) in a theory in which a formal theory of O(U/, A) can 
be developed; in other words, Od(/, A) seems not strictly stronger than O(/, A) as far as 
consistency is concerned. Therefore we only deal with the systems of the form Od(/, A) in 
the following. The definition of OU, A) is obtained from the definition of Od(, A) by 
restricting a in 1.2 of § 1 to range only over A. 
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recursive provable well-orderings in that system are a subsystem of O(2) 
(the system of ordinal diagrams of order 2). The clue is to be found in the 
transition from propositions 1-3 above with Od(/, A) for S and <; for < 
which follow immediately from (1) and (2), to those same propositions 
with N, for S and <, for <. Here N; is the set of i-normal o.d.’s, a subset 
of Od(J, A) defined in § 2. (i-normal o.d.’s were called i-fans in [9].) 

This suggests that we strengthen (1)-(2) by letting S be not only the whole 
field of <, but an arbitrary (not necessarily decidable) subset of that field. 
It is known? that if we strengthen them in this way, allowing only arithmetic 
formulas for S, the system becomesa little stronger, but no new well-orderings 
can be obtained. But if we allow inductively defined notions (specifically 
the notion of an i-normal o.d.) to appear in S, the system becomes strong 
enough for what we want. 

To sum up: the system consists of (I) Heyting arithmetic, (II) the schemata 
(1)}-(2) for arbitrary S expressible in the system, and (III) the inductive 
definition of N;,(cf. NI-N2 in § 3) following Takeuti [10]. This is not an 
inductive definition of the restricted form usual in intuitionistic systems (for 
example in Troelstra’s system), but we still find it sufficiently constructive 
to be convincing in consistency proofs. In (1) and (2) we put <; for < and 
Od(/, A) and N; for S, and the whole proof can be formalized straightfor- 
wardly, as will be seen in § 3 below. 

Of course we are assuming that / and A are sufficiently small (e.g. J = 
w+1, A =? for ITj-analysis) that their well-orderings can already be 
proved in Heyting arithmetic. If we want large J and A of course we have 
to add (IV) axiom schemata expressing the well-ordering of those sets. 


§ 1. Definition of Od(/, A) and its linear orderings (cf. [9]) 


DEFINITION 1. Let and A be well-ordered sets. If there is no likelihood 
of confusion we shall use the same symbol < to denote the orderings of 
I and A; the symbol < as well as = should be understood according to 
the context. For simplicity let 0 be the first element of J. We call an element 
of Od(/, A) an ordinal diagram (o0.d.). The system of ordinal diagrams con- 
structed from J and A is defined recursively as follows: 

1. Every element of A is an o.d. 

2. If w and f are o.d.’s and ie J, then (i, «, f) is an o.d. 

3. If « and f are o.d.’s, then a + f is an o.d. 


3 This was communicated to us by W. Howard. 
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The rank of an o.d. « (r(«)) is the number of (’s and +#’s occurring in «. 


DEFINITION 2. Let a and f be o.d.’s andiel. B c,a (fis an i-subdiagram 
of a) is defined recursively as follows: 

1. If «eA, then « has no i-subdiagram for any ie J. 

2, If @ is of the forme(7, a, a; )) theny? <.0 ifefie.9, andgaae. 
or B is x, andi =j. « has no i-subdiagram if j < i. 

3) it a1s ofthe foum oo. thenviee alt fea a, Or fe, aoe 
(i-subdiagrams were called i-sections in [9]). 


DEFINITION 3. An o.d. a is called a connected ordinal diagram (c.o.d.) if 
the operation used in the final step of the construction of « is not #. 


DEFINITION 4. Let « be an o.d.. We define the components of a recursively 
as follows: 

1. If a is ac.o.d., then « has only one component Le. « itself. 

2. If w is an o.d. of the form a, + «, and the components of a, and a, 
are B,,..-, B, and y,,..., y, respectively, then the components of «, + a, 


are PR, £2, ee. 


DEFINITION 5. Let « and f be o.d.’s. We define « = f recursively as follows: 

1. Ife A, then « = f is the same as equality in A. 

2, If ais of the form (j, &), a), then — f if f is of the formi(j7cn 
With f= 9, 6) = foeandnege— age 

3. If wis a non-connected o.d. with k components «,,...,a,, thena = B 
if B has the same number of components f,,..., 6, and there exists a per- 
mutation (,,:254,) of (2...) such) thatie. —9) eiouen)— len a 


DEFINITION 6. Let « be an o.d. and let je 7. Then / is called an index of a if x 
has an i-subdiagram. 


DEFINITION 7. Let « and f be o.d.’s and let fe 7. We define the relations 
a <,f and « <,, Bf recursively as follows: 

1. Ifa, BEA, thena« <;B anda <, f meanag < Pin A. 

2. Let the components of « and Bf be o,,..., a, and B,,..., B, respec- 
tively. « <;f8 (i¢J oriis «) holds if one of the following conditions is 
satisfied: 

2.1, There exists a B,, (1 <7 7) stich that forsevery | (lp <a 
“, <;B,, holds. 

2.2, k = l.wi> 1 and ¢, — By fomeome m (MS 7 < 71) 
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2.3. k > 1,h > | and there exist ana, (1 < 7 < k) anda, (1 < m < h) 
such that «, = B,, and 


Oy te dE Oy Hy Ht ty, < By EO EB Ht Bmti Ho EB. 


3. Let a and f bec.o.d.’s and ie /. If there exists no index of « or f greater 
than 7 we define 7 to be oo, and if there exists any such indices we define 7 
to be the minimum of them. Then « <; f if one of the following conditions 
is satisfied: 

3.1. There exists an i-subdiagram f, of f such that « <; Bo. 

3.2. % <;f/ for every i-subdiagram a, of « and « <> f. 

4, Let « and f bec.o.d.’s of the form (i, a), «,) and (j, By, B,) respecti- 
vely. « <,, f if one of the following conditions is satisfied: 

Ny tea rae 

AD, = po andi = j. 

joc — Poet — 7 andi, oi. 


Oe toe Gilt, ded 7 (indy, ojala So oy. 


The following propositions can be easily shown: 
PROPOSITION 1. = is an equivalence relation. 


REOPOSMON 2 ce — Pie, — bo, (hen Waa, — fp, 4B, and 
Cc O1 > a>) = te By, B2). 
PROPOSITION 33, Ghia = Be > — B, and 0, <; Xo, then By <; Bo. 


PROPOSITION 4. Every relation <;(ieI or i = ©) is a linear ordering of 
Od(J, A). 


§ 2. Informal proof of well-ordering 


DEFINITION 1. If a relation < orders a set S, and ae S, we call a S-<-ac- 
cessible if the set of all b € S, b < ais well-ordered by <. Here ‘well-ordered’ 
is to be taken in the sense of transfinite induction (not well-foundedness). 
So what we have to prove is that every element of Od(J/, A) is Od(/, A)-<;- 
accessible for every ie J. 


DEFINITION 2. We define ‘an o.d. 8 is an exponent of an o.d. a (denoted 
B < «) recursively as follows: 
l. If ae A, x has no exponent. 
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2. Ii w is of the form a, 4... oe Whete O,,....,c, ale ©.O.dac mnie 
B <a if B < «, for some i(1 <i< m). 

3. If wis of the form (i, a, «,), then B < wif Bis a or B < a or B < ay. 
(Exponents were called ‘values’ in [9].) 


PROPOSITION 0. (Proposition 2 of §3 of [8]). If B <a, then B <q. 
The proof is omitted here. It should be noticed that it does not involve 
the notion of accessibility of o.d.’s. 


DEFINITION 3. The set N; of i-normal o.d.’s (called i-fans in [9]; this is the 
central notion of the proof) is defined as follows by transfinite induction 
on I: 

1. Every o.d. every exponent of which is Od(/, A)-< -accessible is a 
0-normal o.d. 

2. Ifi’ is the successor of iin J, then « is ani’-normal o.d. if « is an i-normal 
o.d. and every i-subdiagram of « is N,-<,-accessible. 

3. If 7is a limit number of J, then « is an i-normal o.d. if « is a j/-normal 
o.d, for every 7 = 1 ind. 

4. «is called an co-normal o.d. if « is an i-normal o.d.for every ie J. 


The following propositions are immediate: 


PROPOSITION 1. Jf every o.d. B <;a is Od(I, A)-<,-accessible, so is a. Here 
Mel Get == 06. 


PROPOSITION 2. Jf a is Od(J, A)-<;-accessible, so is every B <;,a. Here 
iélori= oo. 


PROPOSITION 3. Ifa,,..., %, are Od(J, A)-<,- accessible, so is a, # ... +O, 


These propositions hold also when we replace Od(J, A)-<,-accessibility 
by N,-<,-accessibility (called i-accessibility in [4]), where N; is the set of 
i-normal o.d.’s. 

This induction on a set we don’t know to be decidable will turn out to 
have importance for the axiom system in which we later formalize this proof. 

By propositions 1*-3* we denote propositions like 1-3 except that they 
refer to N,-<,-accessibility instead of Od(J, A)-<,-accessibility. 


PROPOSITION 4. If « is Ny-<j-accessible, it is Nj-<,-accessible, where i’ 
denotes the successor of i in I. 
Proor. We prove this by double induction on <; among N;-<;- 
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accessible o.d.’s and on the rank of an o.d. Let « be N;,-<,-accessible and 
assume as the inductive hypothesis on <;:: 

(1). For every N,-<j,-accessible o.d. 8 such that B <, a, B is Nj-<jac- 
cessible. 

We claim « is N,-<,- accessible. For this we shall show that every i-normal 
o.d, 8B <;% is an i’-normal o.d. and is N,-<,-accessible by induction on 
the rank of f. Let B be an i-normal o.d. such that B <;« and assume as 
the inductive hypothesis on the rank of f: 

(2). Every i-normal o.d. y such that y <; «and r(y) < r(f) is ani’-normal 
o.d. and is N,-<,-accessible. 

For any i-subdiagram fy of 8, Bo is an i-normal o.d. and is <; «. So fo is 
an i’-normal o.d. and is N,-<,-accessible by the inductive hypothesis (2). 
So f is an i’-normal o.d. Since 8 <;« by assumption, one of two cases 
holds: 

Case 1. B <, «. Then f is N;-<,-accessible by proposition 2* and so is 
N,-<,-accessible by (1). 

Case 2. There exists an i-subdiagram « of « such that B <; %). Since 
&%p is N,-<,-accessible and f is i-normal by assumption, f is N;- < ,-accessible 
by proposition 2*, 

Thus by proposition 1*, « is N,-<,-accessible. 


PRoposiTION 5. If « is Nj-<,-accessible, then « is N,-< -accessible for every 
GS 

Proor. This is easily seen from the following lemmas 1-3. 

LEMMA |. Let i be a limit element of I such that the following condition 
is satisfied: 

(3). For any j,k such that j < k < i, every N,-<,-accessible i-normal o.d. 
is N,-<-accessible. 

Then for every j < i, « is N,-<,-accessible if it is N;-< ,-accessible. 

PRoor. Let « be N;-<,-accessible. Let ip be the successor of the greatest 
index of « less than i. We should note that B <,;a if B <,,« and so «@ is 
N,-< ;,-accessible. Hence if j < ig then « is N,-<,-accessible by assumption 
(3). 

In order to consider the case ig < j < i we introduce an auxiliary notion 
‘y is an nth j-branch of B w.r.t. kg and k,’ (where ko and k, are arbitrary 
fixed elements in J such that ky < k,) defined recursively as follows: 

(i). Ifkyo <j < k, and y c, B, y is a first j-branch of B w.r.t. ky and ky. 

(ii). If y <,6 and 6 is an nth k-branch of B w.r.t. ko and k,, y is an nth 
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j-branch of B w.r.t. ky and k, if ky < j < k; and y is an (n+1)st j-branch 
ol Bb wit. kK, andie ite) ee 


We prove lemma | by transfinite induction on <; among N;-<,-accessible 
o.d.’s, using also induction on the number of branches w.r.t. ig and i. 
Let « be N,-<,-accessible and assume as the inductive hypothesis on <;: 

(4) Every i-normal o.d. B <,;a is N,-<,-accessible for every k <i. 
Since for every j < ig lemma | has been proved, it is enough to show that « 
is N,<,-accessible for every j such that iy < j <i. 

Let i) < j < i, B <;« and let B be a j-normal o.d. We shall prove that 
B is an i-normal o.d. and N,-<,-accessible for every / such that ip < / <i, 
by induction on the number of branches of £ w.r.t. j and i. 

Let j < k <i and let By be any k-branch of £. By the inductive hypo- 
thesis on the number of branches, fy is i-normal and N,-<,-accessible. Hence 
B is an i-normal o.d. and B <,« because B <;a. So f is N,-<,-accessible 
for every k such that k < i by assumption (4). 

Thus a is N,-<,-accessible for every k < i. 


By proposition 4 and lemma 1 we have 
LemMaA 2. For every ieI such that (3) holds and for every j <i, if a is 
N,- <,-accessible, then it is Nj-<,-accessible. 


Lema 3. For every ieé I, (3) holds. 
Proor. By transfinite induction on J using lemma 2. 

PROPOSITION 6. Ifa is No-< ,.-accessible, it is N;-< ,-accessible for every i € I. 
ProoF. Similar to the proof of proposition 5 using 


Lemma 4. For every pair j and k in I such that j < k, every N,- <,-accessible 
oo-normal o.d. is Nj- <;-accessible. 
Proor. Similar to the proof of lemma 1. 


PROPOSITION 7, Every co-normal o.d. is N.-< ,-accessible. 

Proor. The proof of this proposition is not difficult but rather laborious. 
We shall give a very brief outline, referring the reader to proposition 14 in 
[7]. 

Let sup(«) denote the successor of the maximal element of A occurring 
in a. Then one easily obtains 
LemMA 5. « <; sup («) for all 0.d.’s «, and for allie I and fori = o. 

So by proposition 2* it suffices to prove 
Lemma 6. [fae A, then a is N,-< ,-accessible. 
This is proved by showing that every c.o.d. « such that a <,, ais N,-<.- 


E XxIII FORMALIZATION OF ORDINAL DIAGRAMS 371 


accessible (considering all possible cases) and applying propositions 3* and 
i: 
By propositions 6 and 7 we immediately have 
PROPOSITION 8. Every co-normal o.d. is N;-<,-accessible for allie I. 
PROPOSITION 9. Every 0-normal o.d. is N;-<,-accessible for all ié I and for 
i = ©. 

Proor. This will follow from the following lemmas 7 and 8. 
LemMA 7. Every i-normal o.d. is N;-< -accessible. 

Proor. By lemma 5 and propositions 8 and 2*. 


LEMMA 8. Every 0-normal o.d. is i-normal for allie I and for i = o. 
Proor. By induction on / using lemma 7. 


Lema 9. If « is an o.d. which is not 0-normal, then there is a 0-normal o.d. 
B such that B <,. « and B is not No-<o-accessible. 
Proor. By induction on the rank of « using proposition 0. 


From this we have 
PROPOSITION 10. Every 0-normal o.d. is No-<-accessible. 


PROPOSITION 11. Every o.d. is O0-normal. 


By this and lemma 8 we have 
CoroLiary. N; = Od(/, A) for all ieI and for i = o. 


By propositions 10 and 11 we have 
PROPOSITION 12. Every o.d. is Od(I, A)-<o-accessible. 


THEOREM. Every o.d. is Od(I, A)-<,-accessible for all ieI and for i = o. 
ProoF. By the corollary of proposition 11 and proposition 9. 


§ 3. Formal theory of Od(J/, A). In this section we shall give details of the 
formalization of the argument of § 2. 

Let J and A be primitive recursive sets of numbers and <, and <, be 
primitive recursive predicates which well-order 7 and A respectively. We 
recall the definition of Od(/, A) and the accessibility-proof of Od(J, A) 
(§§ 1-2), and try to formalize it as described in the introduction. 

For the sake of convenience let J(a) and A(a) be the characteristic predi- 
cates of the sets 7 and A respectively; obviously they are primitive recursive: 
let o be the first element of J w.r.t. <,;. We are interested in the arithmetiza- 
tion of Od(/, A) and shall not always make distinction between notions 
and their arithmetizations. 
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1. Definition of Od(Z, A) and its linear orderings <;. We have introduced 
primitive recursive predicates I, A, <,;, <, and a constant o. Let us fix 
another constant oo for which —/(00) and define ab bya <,bv ((a)a 
b = 00), and I(i) by I(i) vi = oo. It is easily seen that the notions ‘a is an 
ordinal diagram (o.d.)’ (denoted O(a)), ‘a equals 6 in o.d.’s’ ‘(denoted = 
(a, b)) and the linear orderings a <;b (denoted <(i, a,b) for uniformity 
of notation) for each i (/(i) or i = co) can be introduced in a primitive 
recursive manner in Heyting (intuitionistic first order) arithmetic (cf. [7]). 
To be more precise let us fix a basic formal theory which we shall later ex- 
tend to a system containing new axioms and axiom schemata concerning 
accessibility. For the sake of technical convenience we choose a slight mo- 
dification of Gentzen’s LJ(first order intuitionistic predicate calculus [1]), 
enriched by arithmetical axioms and induction: A sequent I = A is called 
admissible if at most one formula in A contains quantifiers (so that all the 
other formulas in A are decidable). The basic system is obtained from Gent- 
zens’s LK ([1]) by requiring that every sequent in a proof be admissible 
and allowing sequents of the following forms as initial sequents (axiom 
schemata): 
1.1. Logical initial sequent: 


Be), 
1.2. Axiom schemata for equality: 


= daa) 


a = b, Mia) — Wd). 


1.3. Mathematical axioms (mathematische Grundsequenz in [3]), ie. a 
sequent [ = A with the following properties; every formula of [ and A is 
primitive recursive and contains no logical symbol, and every sequent ob- 
tained from [ = A by replacing all free variables in [ and A by arbitrary 
natural numbers (i.e. numerals) is true. 

1.4. Course-of-values induction: 


Vx(Vy(y < x > U(y)) > U(x) > Ua). 


In order to introduce primitive recursive predicates O(s), =(a, b) and 
<(i, a, b), several primitive recursive functions and predicates are used; 
among them a <; b (denoted c(i, a, b)) is important. We assume that any 
desirable properties of these primitive recursive functions and predicates 
are listed in the mathematical axioms (cf. [7] for examples of such axioms). 
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2. Accessibility of Od(J, A). We wish to add further notions to the basic 
system so that the system so obtained is strong enough to establish the acces- 
sibility of Od(J, A). 

Since we know as part of our assumption that J and A are well-ordered 
by <, and <, respectively, we can introduce accessibility of J w.r.t. <, 
and of Aw.r.t. <4 as axiom schemata (new initial sequents): 

For any formula (a) 


Al. I(a), Vx(I(x) AVy(y <x > U(y)) > A(x)) > Ala). 
AA. — A(a), Vx(A(x) AVy(y <4x > U(y)) > A(x)) > A(a). 


Let us now follow closely the accessibility proof in § 2. It is again clear 
that the notion a < b (a is an exponent of b) can be added in a primitive 
recursive manner to the basic system. Thus we are in a position to for- 
malize the two simultaneously defined notions ‘a is an i-normal o.d.’ and 
‘an i-normal o.d. a is N,-<,-accessible’. Now let us recall (1) and (2) in 
the introduction, to see how to introduce the notion of an element s of S 
linearly ordered by < being accessible in S w.r.t. < (denoted Ac(S, <, s)). 

To this end, it is convenient to introduce the abstraction operator { } into 
the system as well as a new higher order predicate constant Ac(#o, #,, *2): 
For any formula 2(a,,...,a,), which may contain other free variables, 
{x1,..-, Xn} W(x, .. 5 X,) is called an abstract with n argument-places. 
Ac({x}U(x), {x, y}B(x, y), a) is a formula if {x}U(x) and {x, y}B(x, y) 
are abstracts with one and two argument places respectively. We use the 
following abbreviations: 


Ac({x}U(x), a,b) for Ac({x}I(x), {x, y} < (a, x, y), 5) 
and 
Ac(a, b) for Ac({x}O(x), a, 6). 


The latter will mean that an o.d. b is accessible in Od(J, A) w.r.t. <,. 
Since O(a), I(a) and <(a, b,c) are primitive recursive, we can introduce 
Od(/, A)-<,-accessibility by 


AO1. I(j), O(a), ¥x(<(j, x, a) > Ac(j, x)) = Ac(j, a), 
AO2. I(j), O(a), Ac(j, a), ¥x(O(x) AVY(<(i, yx) > My) > A(x) => Aa), 


where (a) is an arbitrary formula. 
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We are now ready to introduce the central notion ‘a is an i-normal 
o.d.’ (N(i, a)) by an inductive definition w.r.t. 2. To formalize this, let 
O(b) be 

O(b) AVx(<(x, b) > Ac(o, x)), 


and let G({x, y}2(x, y), a, b) be 
O(b) A(UI(a) AVx(xZ a > U(x, b) AVy(<(x, y, b) 


> U(x, y)a Ae({z} U(x, 2), x, y))) 
v(a = OAVX(x = a > U(x, b))). 


Then we can introduce a new predicate constant N(i, a) using Takeuti’s 
kind of inductive definition in [10] (though our system does not contain 
second order quantifiers). 


Ni. (a), N(a, b) => G({x, y}(N(x, y)Ax = a), a, b) 
N2. Ia), G({x, y}(N(x, y)Ax & a), a, b) => N(a, b). 


Let Ac(a, b, c) be Ac({x}N(b, x), a, c) (which will mean c is N,-<,-ac- 
cessible) and introduce the axiom and axiom schema for N,-< ,- accessibility. 


ANI. N(i, a), Vx(N(i, x) A <(j, x, a) > Ac(j, i, x)) > Ac(J, i, a) 


AN2. —I(j), Ac(j, i, a), Vx(N(i, x) AVY(N(i, yA <(J, y, x) > U(y)) 
where (a) is an arbitrary formula. 

Naturally the notion of formula should be extended to include new pre- 
dicates in the axiom schemata (initial sequents) of the basic system, and the 
definition of a sequent being admissible should read: A sequent [ = A is 
called admissible if at most one formula in A contains quantifiers, the pre- 
dicate N or Ac. (Thus the other formulas in A are all decidable.) Finally 
for technical reasons we need an additional axiom schema concerning Ac. 


EA. — Wx(U(x) 4 B(x) > Ac({x} U(x), a, b) > Ac({x}B(x), a, b), 


where (a) and B(a) are arbitrary formulas and <> B denotes (U > B) 
A(& — W). 


3. Outline of the formal proof of accessibility. After the above preparation 
it is rather routine to carry out the formal proof of the accessibility of 
Od(/, A), though writing it down in full detail is naturally laborious. We 
have no essential difficulty in changing the formal accessibility proof in 


E XXIil FORMALIZATION OF ORDINAL DIAGRAMS 375 


[4] §4 into a formal proof of accessibility in the present system. Instead of 
giving technical details we make a few remarks to give the reader some 
idea of how the proof can be carried out, referring him to [7] and [4] §4 
for the details. 

First, we can use properties of primitive recursive functions and _pre- 
dicates freely in terms of mathematical axioms as in [7] (though we need more 
mathematical axioms than those required in [7]). 

Secondly, we can see as follows that no second order quantification takes 
place in the present proof: In [4] § 4 we presented a formal theory of Od(J, A) 
in which the notion of accessibility was introduced by making use of such 
second order quantifications as 


A(i, a, a) (to mean “a is accessible w.r.t. <; in {x|«[x]}’) 
es Vo(Vx(a[x] AVy(aLy] 4 < (i, y. x) > eLy]) > gx) > 9[e)), 


We also introduce the notion ‘a is an i-normal o.d.’ (there called i-fan) using 
inductive definition with second order quantifiers following [10]. Those quan- 
tifiers were required essentially only in order to express these two notions. 
We introduced the inference schema V elimination for a predicate variable. 


o(V), [=> A 
Voa(g), => 4 


under the restriction that %(«) contained no second order quantifiers. By 
going over the actual proof outlined in § 2, we verify that any principal 
formula of an V elimination for a predicate variable 8 occurring in the 
proof is of the form A(i, {x}2(x), a), where (5) is O(b) or F(j, 5). In 
other words, since here we have the predicate Ac expressing accessibility, 
we can obtain the lower sequent of 0 by reading Ac for A there and by 
making use of either AO2 or AN2 with other minor changes. 
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ON THE RELATIONSHIP BETWEEN TAKEUT?’S ORDINAL 
DIAGRAMS O(n) AND SCHUTTE’S SYSTEM OF 
ORDINAL NOTATIONS 2(n) 


HILBERT LEVITZ* 


1. Introduction. In connection with his proof theoretical investigations [13] 
Takeuti defined for each positive integer n a system O(n) which consists of for- 
mal expressions called ‘ordinal diagrams’ together with n+1 well-orderings 
of these expressions [12]. He showed the consistency of acertain subsystem of 
formal simple type theory by assigning ordinal diagrams to the proof figures 
and by arguing by transfinite induction on these ordinal diagrams that cuts 
can be eliminated from the proofs. He then went on to prove the consistency 
of even more extensive subsystems [16] by using ‘ordinal diagrams of infinite 
order’ developed in [14]. 

In this exposition we use O(1) to denote the set of ordinal diagrams of 
order n, and (O(n), <;> for 0 < i < n to denote the structure consisting of 
the set O(n) ordered by <;. If a set A is well-ordered by a relation <, then 
\|ue<A, <)|] will be used to denote the ordinal number corresponding 
to we A when <A, <>» is mapped by its ordering function onto an initial 
segment of the ordinals; ||(A, <)J|| will denote the order type of <A, <>. 

In the author’s doctoral dissertation [5] it was shown that ||(O(1), <,)I| 
is the least w-critical number (cf. [10]). This ordinal appears in Bachmann’s 
hierarchy of normal functions as ¢,,(1) (Bachmann’s functions have 1, not 
0, as the least element of their domain of definition). We also located Veblen’s 
least E-number ([9], [17]) within ¢<O(2), <,>. Under the assumption that 
Bachmann’s hierarchy satisfies certain closure, representation, and recursion 
conditions we showed that ||2 ¢ <O(2), <,>|| equals Bachmann’s g,, ,,(1). 


* The author wishes to thank Professors K. Schiitte and W. A. Howard for their encour- 
agement. 

This work was supported by a grant from the Office of Scientific Research of the United 
States Air Force. 
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Gerber [2] has shown that Bachmann’s hierarchy does indeed have these 
properties. 

Schiitte [11] has developed for each n a system of terms 2(”) and an order- 
ing < of these terms so that <Z(m), <> is a well-ordered structure. 
<Un<o (1), <> is not a well-ordered structure, but it does have a largest 
well-ordered initial segment (2), <)>. The principal result of this investiga- 
tion is that 


ue KO(m), <a> ll = IK205 <>Il- 


Pfeiffer [8] has shown how to delete terms from | ),<., 2(”) (by means of a 
restriction on the term formation rules) so that the resulting set ¥ is well- 
ordered by <, and <Z,, <> is an initial segment of <2, <>. He has also 
shown that ||(2), <>|| = a where a is the ‘Grenzzahl’ of his own extended 
version of Bachmann’s hierarchy [7]. In this extended version the initial 
ordinals @, (n < @) play a role. 

Isles [3], working independently of Pfeiffer, has developed an extended 
version of Bachmann’s hierarchy in which initial ordinals even greater than 
@, play a role. 


2. The system P*(n). In [6] we described a system O*(n) which differed 
from O(n) in inessential respects, and for which ||(O*(m), <;|| = 
= ||(O(n), <>|| when 0 < i < n. We then described a system P*(n) such 
that <P*(n), <,> is an initial segment of <O*(n), <,,> which in turn is order 
isomorphic to an initial segment of <P*(n+1), <,.,>, the latter initial 
segment being the one determined by [1, [1, 1]]¢P*(m+1)'. We find it 
more convenient to work with the system P*(n) because its members are 
built up using two place expressions [ , ] as opposed to the three place 
expressions[ ; , ]used in O(n). Below we show how P*(n) is defined. 
In addition we state some lemmas about P*(n) which will be needed in 
relating P*(n+1) to X(n). For the proofs of these lemmas the reader should 
consult [6]. 


DEFINITION 2.1, 

(a). Let n be a fixed positive integer. The expressions in P*(n) are called 
ordinal diagrams and are generated by means of the two operations + and 
[ , ]as follows (for brevity we call a member of P*(n) an o.d.): 


+ In [6] we used the notation (1, (1, 1)) imstead of [1, [1, 1]]; we prefer to use square 
brackets here because round brackets are used in forming the members of (n). 
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El 1. OeP*(n): 

E1.2. IfweP*(n) and i is an integer such that 1 < i < n, then [i, u]e 
P*(n). i is called the index of the o.d. 

E1.3. If u, ve P*(n), then so is u + v. 
(b). An ordinal diagram w is called connected if the operation used last in 
the construction of u is not +. That is, wis 0 or w is {i, v}. 
(c). The notion of component is defined recursively as follows: 

E2.1. If wis connected then uw has for its components only w itself and 0. 

E2.2. If u is of the form u, + uv, and the components of u, and wu, are 
X1,X2,-.-,X, and y,, y2,..., y, respectively, then the components of u are 
eG succes Me ics Pao ae 


DEFINITION 2.2. We define the notion of equality recursively as follows: 

E3.1. If all of the components of u are 0, then u = v if and only if all of 
the components of v are 0. 

E3.2. If u has a component different from 0, and if u,,..., u, are the com- 
ponents of uw which are different from 0, then u = v if and only if among 
the components of v there are precisely s of them v,,..., v, which are dif- 
ferent from 0 and there exists a permutation (7,,..., f,) of (1,...,5) such 
that u,, = v,,, form =1,...,5. 

Eos ie and eID ly it. xv) = J. 1]. 


DEFINITION 2.3. If u and v are o.d.’s and i is an integer satisfying | < i < n, 
we define the relation u <; v recursively as follows: 

E4.1. If vis 0 then u c; v does not hold. 

E4.2. Let v be of the form [j, v9}: 

E4.2.1. If j < i, then vu <; v holds if and only if u ; v9 holds. 

E4.2.2. Ifj = i, then u c; v holds if and only if uw is v%. 

E4.2.3. If j > i, then wu <;v never holds. 

E4.3. Let v be of the form v, + v,. Thenu c, v if and only if u <,; v, or 
Oe 


DEFINITION 2.4. We define the relations u <;v (0 < i < n) recursively as 
follows: 

E5.1. Let the components of wand v be u,,..., u, (kK 2 1)andz,,..., v 
(A = 1) respectively. u <;v holds if and only if one of the following condi- 
tions is fulfilled: 

E5.1.1. There exists v,,(1 < m < A) such that for every s (I < s < k) 
U, <, Vn 
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E5.1.2.k =1, h>1, u, =v, for a suitable m(1 < m< &#), andi 
for some t # m. 

E5.1.3. k > 1,h> 1, and there exists u, (1 ss<4 4) and vj(l 7a) 
such that u, = v,,andu, +... + ui, FE Uy, HE Ey, <p Dy 4 
FE Uni FE Umti FE ee FE Dg. 

E5.2. Let u, v be c.0.d.’s, Then vw <,9 (i = 1,.. 22,7) if andvonlyaf Gace 
of the following conditions is fulfilled: 

E5.2.1. There exists z such that u <,z C,v. 

25.2.2. jen? andezee | wimplies2-<; 0: 

E5S.3. If v has the form [j, y], then 0 <9 v holds. If u has the form [i, x], 
then u <j, 0 does not hold. 

E5.4. Let u and v be c.o.d.’s of the form [i, x] and [j, y] respectively. 
u <j, vif and only if one of the following is fulfilled: 

E5.4.1.i > j. 

E5.4.2.i>j and x <,). 


DEFINITION 2.5. We assign to each ue P*(n) a non-negative integer called 
‘the length of u and we denote it by Lu: 

E6.1. £0 = 0. 

E6.2. L(u + v) = Lut+Lo. 

E64. Li, x| = Exh 


DEFINITION 2.6. In the system P*(n) we define the notation /,, recursively 
as follows: Jp = 0, 1,41 = 1, + [n, 0]. Usually we write 1 instead of /,. 


LEMMA 2.7, ({6], lemma 3.3). Jf0 <i< k <n, thenx <,{i, x]. 


LEMMA 2.8 ([6], lemma 3.4). [i, x] <,Lj, y] if and only if at least one of 
the following holds: 

(a). k= OA Ge) 

(b). k > OA (fx) <p-ali vy] A(Gi> kv (i < kAx <; Li, y)))). 

(C.) KS OAs eA] 
LemMA 2.9 ([{6], lemmas 3.5, 3.6]) 

(a). Os k= i < no eand sx iy i x  ( 

(b). 0 < k <i < n,0nd Oj ine  lyik: 


DEFINITION 2.10. We define the functions f, for 1 < k < nas follows: 

CD). AOS. 

(2). f,(uy He os 2 ,) =e Be es Seconnecicdmmie:, 
SS. 
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[A, [j,x]] ifl <j<k, 
(Bye ls ay ic ei — sr ad veal, lee where pie 
[j, x] otherwise. 


LemMA 2.11 ((6], lemma 3.8]). Jf1<k <n, andOQ<p<k, thenu <,v 
implies that f,u <, f,v. 


DEFINITION 2.12. For 1 < k < n we define the functions F, as follows: 
(1) e020 
Cymer eve) er ied etter tet, 1S commected for 
ier = 5. 
O> ding ie 
x it 7 — sand x — 1 p,.y)) where p =k, 
GOB ="4 Wipe) +i, )| itk=j ands = [p,y] + Lat 
where p < k, 
[j,x] otherwise. 


LemMA 2.13. 
OQ). =x 
(b). f,.F,.x = x provided each non-zero component of x has index equal or 
greater than k. 
(c) fl <k<n, then u <,_,v implies F,u <,F,v provided each non- 
zero component of u, v has index equal or greater than k. 

PROOF. 
(a), (b), Immediate from defs. 2.10. 2.12. 
(c). Suppose not, then F,v <, F,u. But then using part (b) of this lemma 
together with lemma 2.11 we getv =f, Fv <,-1f,F,u = u, contradiction. 


3. The system 2*(n). The system 2(n) does not have an identity element with 
respect to the operation ++. Below we describe a system 2*(n) which has such 
an identity. In doing so it will be necessary to modify certain clauses in the 
definition of 2(n) which are not compatible with the presence of an identity. 
Failure to make these modifications would result in u < u + 0. 


DEFINITION 3.1 (a). Let 1 be a fixed positive integer. The members of 2*(n) 
will be terms defined recursively as follows: 
GIs The symbeles2,, (,,. . w@paresterms. 
G1.2. If u and v are terms then so are (u, v) and u + v. 
We sometimes write 0 instead of Q). 
(b). Terms of the form Q; and of the form (u, v) are called connected. The 
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notion of component is defined exactly as in clauses E2.1 and E2.2 of §2. 


DEFINITION 3.2. To each term u we assign a non-negative integer called the 
length of u (denoted by Lu) and a non-negative integer called the Jevel of 
u (denoted by Su) recursively as follows: 

(2M, A 0), 

G22 07 — Aion Wena 

G2.3. L(u,v) = Lu+Lvo+1. 

G2.4. Lu, +... # u,) = Lu, #...# Lu, for connected terms 
aimee v7 Usa) D) 

G25. SO; —9 (7 Osbee): 

G2.6. S(u, v) = Sv. 

G2.7. S(u, #... + u,) = max {Su,,..., Su,} for connected terms 
ee easels al (arene |B 


DEFINITION 3.3. We define equality among terms recursively as follows: 
Goa Of= OG = Oalee)s 
G3.2. u; =v, and 4, — ep imply (4, %))— (01, 2). 
G3.3. Same as E3.1. of § 2. 
G3.4. Same as E3.2. of § 2. 


DEFINITION 3.4. We define the set of i-coefficients K,;u of a term u recursively 
as follows: 

G4.1. If wis connected and Su < i, then K,;u = {0, u}. 

G4.2.If0 <i<k <n, then K,;Q, = {0}, that is the set whose only 
“member is the symbol 0. 

G4.3. If i < Sv, then K,(u, v) = K;u U Kyv 

G44. Ku, + .. ashy) = Ky Oe OK, for wie... Connected 
(= 1): 


DEFINITION 3.5. We define the relation < on 2*(n) recursively as follows: 

G5.1. Ifi < Sv(y= 0) oe athens 2 (en): 

G5.2. If wis connected and Su <i (i =1,...,7), then vu < Q,. 

G5.3. If u, < v,, Su, =1, ande (Aga, © {u,)) implies x <= (cpeey 
then (u,, v2) < (v1, v2). 

G54. Ifu, =v, and wo eaten at) eae 

G5.5. Ifv, < u,, Sv, = i,and (u,, u.) < x forsome x € (K;v, vu {v2}), 
then (uw,, uz) < (v1, v2). 

G5.6. Same as clause E5.1. of § 2, reading < instead of <;. 
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DEFINITION 3.6. We define the expression /,, and the functions 1+x, —1+ x, 
and Q;+x for0 < i < nas follows: 
(1). Jo = 0.41 = 1, + (0, 0). We will normally write 1 instead of /,. 
x+#1 ifx =/, for some m, 
(2). l+% = * otherwise. 


a, 
Oe as ' otherwise. 

x if x has a component (uw, v) where Sv > i, 
Ny Opies = 
Ce: a 4+ x otherwise. 


The following lemma states some elementary consequences of definition 3.6. 


LEMMA 3.7. 
(a). Sx = iimplies x = Q;+y for some y. 
(b). x < y implies Q;+x < Q;+y. 


DEFINITION 3.8. We define the sets C; and the functions 4; for0 < i < nas 
follows: 

(1). x € C, if and only if x = Q; where 0 < j < i or x = (u, v) where u 4 0 
and Sv < i. 

x ifxeCc,, 
Cy (0 at — pe, and ye C., 
(0, x) otherwise. 

LEMMA 3.9 (Simple consequences of def. 3.8). 
(a). A;,x is a strictly increasing function of x. 
(b). For a given i, (0, x) = h;y where y = x ory =x #1. 
(c). If Sw < i, then w = u + v where each non-zero component of u is of 
the form h,z for some z, and each non-zero component of v is of the form 
(x,y) where x # 0 and Sy = i. 
(d). x < (1+, v) implies h(x) < (1+u, v). 
(e). x <= hp 
(f). Ifu#é 1 and Sx > k, then ue K,x implies ue K,h;x. 
(g). Aix < (0, h;x) for any i, j. 
(h). i> jimplies h,x < hx. 
Lemma 3.10. 
(a). v < (u,v). 
(b). Sv =i and xe K,u imply x < (u,v). 

ProoF (Cf. [11], lemma 9.). The proof given there is for Z(m), but it 
applies also to 2*(n). 
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The following recursively defined mapping ’ maps 2(n) onto {x|x € Z*(n) a 
Ax # O}, and preserves order: 
(1). Ga 4. 4 u,) =a, Ese, if uw, connected for |< 7e 
(2). 1’ = (0, 0). 
(3 ee torre we 
eyo) = ee 
(5). (x, vy)’ = (-14+2', -14+y’')ifx 4 1. 


4, The principal mappings. In this section we describe an order preserving 
mapping from <P*(n+1), <,4,> onto an initial segment of <2*(n), <). 


DEFINITION 4.1. For 0 < k < n+1 we define the mapping B, from P*(n+1) 
into 2*(n) as follows: 
lite = [ele wheter 


rites li otherwise. 


DEFINITION 4.2. For 0 < k < n+1 we define recursively the mappings g, 
and H,, from P*(n+1) into 2*(n) as follows: 
(1). gx(0) = H,(0) = 0. 
(2). gu(uy tt Uz AE... AE Us) = Gully HE Gyn HE... AE IRUss 
A, (uy dt ug dt ooo dt u,) = Au, aie Au, aie eee dt Af, u, if u; is con- 
nected for ) =< 7 =< 
Oil 7 2ye 
(3). A, fi, x] = Hen (paren ifi=k-1, 
G16) tf < /k=l|, 
UBS Ce oe leg) i eae, 
pales x] = (1+A;,x, eer (Dew dt IF x)) ifz = k—l, 


It will be seen later on that g,, is the order preserving mapping referred to 
in the opening remark of this section. 


LEMMA 4.3. Assume 0 < k < n+1, then: 
(a). Sg,[i, x] = n+1— max {k, i}. 
(Bb). © p44 = Gg, ee) pronided are 
(c). If i > k and g,[i, x] = (u, v), then u = 0 or S(u, v) < n+1—k. 
(d). Uf wu = 9,41-,(v), then if k* < k, there exists v* such that 
U = Gn+i—qr(0*). 
(¢). S21 and 52k imply Glow = Gnas ol 
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(f). s >i and s >j imply g,{s, x] = g,{s, x]. 
(g). i<j <8 implies g;[s, y] < g,{s, y]. 
PROOF. 
(a). By induction on Lg,[i, x]. 
(b). Immediate from part (a) together with def. 3.5. 
(c). Immediate from def. 3.8. 
(d). We can assume that v is connected. If v = 0, then trivial, so we can 
further assume that v has the form [j, f]. 

(dl) k= Thentrivial; 

(2) kee ke: 

(d2.1). j < n+1—k*. Then using def. 3.8 and lemma 4.3(a) we see that 
es 0) — Pee geet ul io) Gasol l—k”, [7,4]: 

(d2.2). 7 > n+1—k*. Then k > k* > n4+1-j: 

(d2.2.1). g;t is of the form (p, q) # /,, where p # 0, S(p, q) = k*. Then 
using def. 3.8 along with the fact that g,(t +1) = g;t #1 we get 
9n+1—-wL Jj, 4] = hy(Q,41-7 +958) =h(Q,41-;+9;t #L) = Gatton! ot: UE 

(d2.2.2). otherwise. On account of n+1—j < k we get that Q,4,-j;+9;¢ 
is not of the form 2, + /,, for some m, so 
In+1—KeLJr t] = Irye(Qn41—7+95t) = WM Qne1— 74954) = Ino. cL, ¢). 

(ec). Follows from lemma 3.9(g). 

(f). By def. 3.8 we would have under the given hypothesis that 

fine r= 2nti-st+9sY) = Mas eee oy). 

(g). By lemma 3.9(h) hy41-(Qn41-s+9sX) < Mn+ 1—-{Qn+1-st 9s) 


DEFINITION 4.4. We define the projection functions R and Q as follows: 
R(x, y) = x and O(x, y) = ». 


LEMMA 4.5. Assume 0 < k < n+1 and x* is a component of x, then: 
(a). Il <j<k—l, then ue K,4,-,Ro,{j, x*] implies that 
UE Ky41-R9xL/, *1. 
(b). 1 <j< ntl, then g,[j, x*] < 9,[, x]. 
PROOF. b 


(a). The trick is to show more generally that for 1<t<k-j, 
ué Ky+1-,Rg;++L7, x*] implies ue K,+1-,Rg;+:[7, x]. This can easily be 
done by induction on ft. One considers cases t = 1, t = 2, and ¢t > 2. The 
induction hypothesis is only used in the latter case. 

(b). This is done by induction on k making use of part (a) of this lemma and 
Iemma 3.10. The cases to be considered are j >k, j = kK—1, andj<k-—1. 


386 H. LEVITZ E XXIV 


The induction hypothesis is used only for the latter case. 


LEMMA 4.6, Suppose 0 < k < n+, then: 
(a). oft. Z] < hati—x(orLt, Z]) provided 1 <t<k. 
(OD) Gh? ee eeeape?): 

PROOF. 
(a). From t < k we get by lemma 4.3(a) that Sg,[¢,z] = n+1—k. From 
t < k we also get using def. 4.2 that g,[t, z] has the form (u, v) where u # 0. 
So by def. 3.8 h,.,-,(9,[t, 2) = (0; go. [6 2): 
(b). Ay+1—¢(2,+1-n+9,%) 18 connected and by lemma 3.9(a) 
Ain 1 eC Quote GeX") < Myer —n(Qn+1—~+9xx) Whenever x* is a compo- 
nent of x, so it suffices to show that g,x* < hyy1—,(Q,41-4+94x*) for 
each component x* of x. 

(b1). gx* < O,44-.49,2°. Then using lemmar3-9(d) 
GX" < Qne1-n+ GX < iy 41 —-e(Qn41 K+ In X"). 

(b2). 9,x* = Q,41-~+9,x*. Then it suffices to show 
Gee pean (Gem 
Noting that Sg,x* >n+1—k we see from def. 3.8 that (0, 9,x*) = 
= Inti GX") 
LemMa 4.7. Jf 1 < k < n+, then: 
(a) 1l<,yAy=[,x] A i2 Kimplies Wry: 
(b). 1 <, yandy connected implies 1 < g,y. 
(c). OFyy = 1, for some t > 0 implies that y = z + 1, where each non- 
zero component of z has index less than k. 

PROOF. 

(a). Immediate from def. 2.12. 
(b). Immediate from def. 4.2. 
(c). Suppose not, then we obtain a contradiction by using (a) and (b). 


LEMMA 4.8. Assume 1 < k < n+1, then: 
(a). GFX < Gx. 
(BD) gin <6; eed 
PROOF. 
(a). It suffices to consider the case where x is connected: 
(al.). x = 0. Then trivial. 
(a2). x = [s, y] for some s and some y: 
(az). Fx = x. Wheneg shea 
(a2.2). Fx = y. Then by def 2.12 we sce thats — 4 ands 02 50 by 
lemma 4.5(b) 9, FX = Ge¥ < Inet —e(Qno1—nt IY) = lk, Y] = 94x. 
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(42:3). fpe— ko olewhetem- le — ye when by del. 2:12 we sce that 
s=k and g,F,x = glk, v) = Mya 1—(Qnt1—et GY) < Ang 1—1( Que 1-0 + 
+9(v # 1)) = alk, y] = 9x. 

(b). 

(b1). F,[k, x] = [k, x]. Then if x = 0 our result is trivial, while if x 4 0 
weeecet sbyemlemmam (Dlg x <a, sac pct ox) — 9, [ks ool 
= 9. F[k, x). 

(2) eke x — 2 Chen g,— 9, Fk, x |e 

(b3) 92, [ko x)= [k, vl] where ve =x. By def. 2.12 we see that » 4 0, 
$0 g,X = Gv #1 < Aye —e(Qre 1+ HP) = 91K, v0] = 9 F, [k, x]. 


LEMMA 4.9. 
(a). fils k-l <n and 1 <i=k-—1, then 1 < Rg,{j, fi, yl]. 
in addition y an i” then Rg,{j, [i, y]] is connected. 
(b). 1 <j<k-1 <n, then g;,,[k, y]€ Ky41-.R9xLi, [k. y]] provided 
that2 <t < k-j. 
(cj) if l= j =< k—1 & mm then g, (ks y)/= 2,[/, (k; y]]. 

PROOF. 
(a). Immediate from def. 4.2. 
(b). We first note that under the given hypothesis ae Fyiilks y] = 
= gj;4:[k, y]. So it suffices to show that 9,4, F)411k, ¥] © Kn41-¢RGj4iL), 
[k, y]]. We show this by induction on ft: 

(b1). ¢ = 2. Then RGj+ebd, (AV) = 94104 [Kyl = ae oe iy 
+ (Bj +1[k, y] # 9;+1 Fj) +11k, y))). Since S9;41Fj+41[k,y] =n+1—k we 
get SS Sle aay Aiiiris se with rr (fatal = 
> nt+1—k gives 9541 Fj+1[k, ¥] © Kyo 1-4 R9j4L4, [Ky]. 

(b2). t =r+1 for some r > 2. By induction hypothesis 
Gj Fj4ilk, V1 € Ky41-2Roj+-L), [k. yl]. But Sg, (A. yJ]] =n+1+ 
—(j+r) >n +1—k, so 9541 Fj41[k, ¥) © Ky41-29;4+-L, [k, y]]. It remains 
only to note that 9j4,L/, [kyl] = RojseLi, Uk yIl. 
(c). glk, IE Kn+1-«R9xLj, [k, y] by virtue of part (b) of this lemma. But 
S9,U); [ky] = n+1—k, so g,[k, ¥] < gL), [k, y]] by lemma 3.10. 


LemMaA 4.10. 
(a) I l<i<k-lLi<s<n+l, 0<p<n+t+1-k, and u ¥ 1, then 


ué Kgi+1 Fisi[s, y] if and only Jes Jil; y). 
(b). Pl<j<i<nt+l,0 <p <n+1-i, and u# 1, then ue K,g;x 
implies u € K,9;+1Fj+1[, x]. 
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PROOF. 
(a). 

(al). i+1 <s. Then F,4,[s, y] = [5,9], 80 9:41 Fi+1[5, ¥] = 9:41[5, y]. 
But g;+1[s,y] = gils, y] by lemma 4.3(f). 

(a2). i+1 =s. Then n+1—s =n+1—(i+1) > n+1-k 3 p: 

(a2.1). y is not of the form [#, w] + 1, where t < i+1 and m > 0. Then 
F;41[8, y]=[s, ¥] 80 9:41 Fi+i ls, yl=gierls, VY) =My41—(24:1)(Qn41-st9sy): 
Therefore since u # 1 we get using lemma 3.9(f) that ve K,gi+1 Fi+1[8, y] 
if and only ifu € K,g,y, andu e K,g,y if and only if uw € KyAy41-(Qa41-s+ 
+49,y). It remains only to note that h,+1-;(Q.41-st+9s¥) = gils,y]- 

(a2.2). y is of the form [t, w] + J, where t < i+1landm > 0; 

(a2.2.1). m = 0. Then Fy+,[s,¥] = y 80 9:41 Fi+1[s, ¥] = gisiy. Now 
since i+] = S, 9;[5,¥] = Ano1-(Qnr1-st9sV) = hae 1 —-i(2n41-(641) + 
+ 9i+1)). From this it follows using u#1 and lemma 3.9(f) that we K,9;41 
if and only if we K,g;,[s, y]. 

(a2.2.2), m > 0, Then F,, ;[s, vy] = |[s, vl) where @ =) = Ths 
9is1F iss [9,9] = Git l8, 0] = Past -cGti(Qnt1-G+1y+9i410)- On the 
other hand g,[s,v] = My41-(2,41-G41)+9i4+1¥), SO Since u ¥ 1, 
ué K,g;+,[s, v] if and only if we K,g;,,v, and ue K,g,,,v if and only 
if ue K,gi4i1y, and u€ K,g;41y if and only if ue K,g,[s, y]. 

(b). 

(b1). Fj+1, x) = [i, x]. Then 9,41 Fj4i[i, x] = gj411 x] = 
= haa1—g+1)(Qr+1-i+9i%), thus since u #1 we have by lemma 3.9(f) 
that ue K,g;x implies ue K,g;+1F;+:[i, x). 

(62). F,..[i, x] =. For this te vhappensy—7-- i but then 
9j+1F 5411, x] = gix. 

(b3). F;4,[, x] = [i, v) where v + 1 = x. For this to happeni = j+1 so 
9541 Fj 41 Lb ©) = opi fi, 0] = Ina —G41)(Qn41-i+ giv). Now since u # 1 
we get u € K,g,;x implies u € K,g;v, but then we are finished by lemma 3.9(f). 


Lemna 4.11. 

(a). Ifl<j<is< ntl, then g;x < 9)41F)+,[i, x). 

(b). If 1 <j<i< n+l and n+1-i< p, then ue K,g,x implies u< 

< 9541 F +104, x]. 

(c) If 1 <j<i<n+1 and n+1-i< p, then Sg,,,F,.,[t, x) < p. 
PROOF. 

(a). 


(al). Has) be x] = [i, x]. Then Gj+1 Fi 41h, x]= fine —G41)(Qn41-i+ 92%) 
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and the desired result follows from lemma 4.6(b). 

(a2). F,4,[i, x] = x. For this to happen it must be the case (cf. def. 2.12) 
that f = j+1 so gj41Fja1.X = 9GiXx. 

(a3). F,41[i, x] = [i, v] where x = v # 1. For this to happen it must be 
the case that i = f+1, so 94, F)41:[i, x] = g;[i, 0) = Ayo 1-(OQn41-7+-9;0)- 
If v =0 then x =1 so the desired result would be trivial. If v 40 
then g;0 < hj44-(2,+41-;+9,¥) by lemma 4.6(b), so g,x =g;0 #1 < 
< hyn -i(Qn41-i+ Gi). 

(b). We first note by lemma 4.3(a) that Sg;x < n+1—i < p,sou = g;x* 
where x* is a component of x. So using part (a) of this lemma u = g;x*< 
<GiX < Gps Fj41[i, x]. 

(ce) 

(cl). Fj: [, x] = [i,z] for some z. Then by lemma 4.3(a) we have 
Sgj+i[i, x] =n+1—i < p. 

(c2). F,4,[i, x] = x. For this to happen? = j+1 and x = [s, z] for some 
z and some s < i. Then by lemma 4.3(a) Sg,+,[s, y] = n+1—i < p. 


LEMMA 4.12. If 1<i<k-l<n, 0< p<n+1-k, then l#AueE 
K,Rg,{i, x] implies that u < ve K,g;x for some v or that u=hw for 
some s and some w ‘such that Sw < p and for which zé K,w implies 
we Kg ix. 

Proor. By induction on k—i: 

(1). k-i = 1. Then Rg,[i, x] = 1+H,x, since u # 1 we can assert that 
ué K,H,x. From this it follows that ue K,H,x* where x* is a component 
ol celiege— O1or ~  — [s,y| iors = & then 4,x* — 0 and our desired 
result would be trivial, so we can assume x* = [s, y] where s < k—-1. 

(1.1). s<k-1. Then H,[s, y] = 9,-1[s. y] = g;[s, y] thus ue K,g;x*, 
sowve Kg) x. 

2 seer thene A iscy) — Meno Geeiy): 

(1.2.1). Shy41-ce-1)(Gx-1Y) > p- Then ue K,g,-,y. But fromi=k—l=s 
it follows that g;[s,y] = /tya1—ce-1)(Qn41-(e-1) +9x-1¥), then using 
lemma 3.9(f) together with the fact that wu # 1 we get ue K,g;[s, y]. 

(12:2). Sh, een (Gee yy) p. Then uh, 1—G@-1)(Ge-1))- We see 
then that g,_,y plays the role of w mentioned in the statement of this lemma, 
for zé€K,g,-1y implies z€ Kyliy41—(-1)(Qn4+1-(e-1) + Gx-1¥) While 
Frys 1 —(e-1)(Qat 1-1) + GJe-1Y) = Gu-1[kK—-1, ¥] = Gils, ¥] = gix*. 

(2) pare — 2 hen weg, lt, ak 9p, X) = (1 Ay 4%, Qpei-aei 
+(B,-1% # Gx-1 Fy-1%)) sou e K,H,-1x orue Ky g,-1 Fy-1X: 
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(2.1). we K,9,-1F,-1x. Then ue K,9,-1F,-1x* where x* is a con- 
nected component of x. If x* = 0 then u = 0 and our result is trivial, 
sO we can assume that x* is of the form [s, y] where s > k—1 (ifs < 
<k—-1,9,-,;F,-,x* = 0). Recalling thatk-—1 =i+1andu¥ 1 we have by 
lemma 4.10(a) that ue K,g;x*. 

(2.2). we K,H,-,x. In this case we can proceed in the same way as in 
case 1, except that we replace all occurrences of k—1 by k—2. 

(3). i<k—2. Then Rg,[i,k] = 94-1 li, x] = (9-2li, x], 0, 41=( ee 
Since n+1—(k—1) > n+1—k 2 p we see that we K,Rg,[i, x] implies 
ue K,Rg,-;[i, x] so we have the right to employ our induction hypothesis. 


LemMa 4.13. Jf 1<i<k—-l<n, 1<j<m-1<k-l, 0<p< 
<n+1—k, and u # 1, then ue K,g;x implies u < ve (oy. [i, x]] vu 
U K,Q9mli; li, x]]) for some v. 

Proof. We argue by induction on m—j. u € K,g,;x* for some component 
x* and we can assume that x* # 0,so x* = [s, y] forsomes and some y: 

Ce — ie 

(1.1). 7=j. Then g,,[j, fi, *]] = gia. lt, (i, x)) = Gasiei9ix, Qe 

(1.1.1). Sg;x > p. Then since by hypothesis u # 1, we get by lemma 3.9(f) 
ue Kyhy41-iGix- 

(1.1.2). Sg,x < p. Then u = g,x* < g,x = hy41-:9:x € K, Ro pli, li, J]. 

(22) ta then? aie 9j+i14 Li, x]] = (9;li, x], 2,41-m # 1). 
It suffices to show that u < ve K,g;4,[i, x] for all ¢ such that /-i > ¢ > 1. 
We do this by an ‘inner induction’ on f: 

(1.2.1). p= 1. g,+,li, x] = 9:41.18, x) SM 4 Bax, OM 
+ (Ba. # 9:41 Fi41x). In view of the fact that p < n+1-—(j+1) < 
<n+1-—(i+1), it would suffice to show that u < v € K,H;,,x* for some v 
or that u < ve K,gi41 F;4,x* for some v: 

(2.1.1), soe. Then fea. ee 

(1.2.1.2). s=%. Then g;x* = 4,79%,(2,4,—-,-+ 9,)). But thenfromm ee 
—i> p we see that ue K,g;x* implies u € K,g,y. Furthermore H;,,x* = 
= h,+1-i9i¥, 80 we need only show that uw < ve K,h,,,-;9;y for some v: 

(1.2.1.2.1). Sg;v > p. Then since u # 1, we have by lemma 3.9(f) that 
ué Kyhys1-iGiy- 

(1:281.2.2). Sg,y = p. Then wagy = hee Bul hj ae 
Kyhtae1-iG iv: 

(1.2.1.3). s > i. Recalling that i <j < m—1 < k—1 we see using uv # 1 
together with lemma 4.10(a) that we K,g;41Fi+,[s, y]. 
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(1.2.2). t > 1. So we write ¢ = r+1 for some r > 0. Now g;,,[i, x] = 
= Gi+e+i lt, x) = (Gi4,[6 x), @.41-c+n)- By the inner imduction hy- 
pothesis we K,g;4,[i, x]. This with p < n+1—(i+?) gives ue K,g:4,[i, x]. 

(13) 7 = Now Sgr, x)= 0; LPS eres) oe (Bil, x] + 
+ (9j41 Fifi, x]))): 

(1.3.1). 2+1—i > p. Then by lemma 4.10(b) we get u € K,9j41F)+,[7, x]. 
filtat is ved Og (iy, (2.1): 

(1.3.2). n+1—i < p. Then by lemma 4.11(b)u < g;41Fj+:[i, x]. By 
lemma 4.11(c) Sgj41Fj4i1li, x] < p 80 gj41Fy4ilt, x) © Kp9j41 Fjoi lt, x1, 
that is, gy+1 Frail, x] € Kp Q9nli, li, XI]. 

(2) ae by Beng, (it. |= (geet, tt. xl), 2,41_,,)- By induces 
tion hypothesis u < v € (K,Rgm—1 1), [i, x]] U K,Q9m- 1), [i, x]]) for some 
v. But Sg,,—1Lj, fi, x]] = 2+1-—(m—-1) > n+1-k 2 p, so ve Ky gn-il/, 
eel) Thus ve kK, Rg), (t,x). 


LemMaA 4.14. /f 1 <i<gk—-l<nand|1<j<k-—l, then ueK,.,_,R9,li, x] 
implies u < g,{j, [i, x]]. 

PROOF. 

(ew iheniv< 9,17, [ix by dels. 42 and 3.5. 

(2). u > 1. By lemma 4.12 u < ve K,,1-,9;x for some v, or u = A,w for 
some s and some w such that Sw < n+1-—k& and for which ze K,4,_,w 
implies z € K,41-,.9;X: 

(2.1). uw < v€ K,+1-49;x for some v. By lemma 4.13 

v < we (Kya -cRoeLs, [EX] UV Kae 1-1 OGL, Fi. x))) 
for some w, but w < g,[j, [i, x]] by lemma 3.10. 

(2.2). u = h,w for some s and some w such that Sw < n+1—k& and for 
which ze K,,,-,w implies ze K,.,-,.9;%: 

(2.2.1). A,w = w. Then we K,4,-,w so WE K,4,-,9;x. Then we have 
only to use lemma 4.13 followed by lemma 3.10 to get w < g,[j, [i, x]]. 

(2.2.2). hw = (0, w) or k,w = (0, t) where w = ¢ + 1. Then Rg, hw = 
—tORceg ali il Morcoverze K,.,=, Ow implies 2e K, 212,9;%, 80 
by lemma 4.13 followed by lemma 3.10 we get z < g,[j, [i, x]], thus hw < 
< Il; [i, x]]. 


LemMa 4.15. [fl <j <k < nt], then g,x < g,[/, x]. 

Proor. By induction on k. Our induction hypothesis is that if k’ < k, 
then for every j and every x,j < k’ implies g,.x < g,[j, x]. Let j, x be 
given where j < k. Since g,[/j, x] is connected it suffices to show that g,x* < 
< g,[j, x] for every component x* if x. Moreover if x* = 0 our result is 
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trivial, so we can assume x* is of the form [i, y] for some i, y. By lemma 
4.5(b) it would, in fact, suffice to show g,[i, y] < 9,L/, fi, y]]. 

(1). 7 =k. Then using lemma 4.6(b) 9,.[i,] < Ag41—i(Qn41-nt+ Geli, VIV= 
= ILI; li, Dale 

(Q)ag <ais: 

(2.1). i>k. Then Sg,[i, y]) = n+1-i<n+1-k = Sg,{j, fi, y]] so 
finished by def. 3.5. 

22a 

(2.2.1). j = k—1. Using lemma 4.8(b) and lemma 3.10 we get 
Qaet—et GY < Qeer—-n+ (Beli, Vv] + FL VI) < 

< (1, Qu41-4+ Bly] + Fi y)) = aL, Gy). 

(2.2.2). j < k—1. Then by lemma 4.9(c) g,[i, y] < 9,L, fi, y]]- 

(23) ek: 

(2.3.1). 1<k—-laj—k=1. Then Ro, [yl — eerie ll — eo le 
and Qg9;[i,¥] = Qa41-1 < Qu+1—-, #1 = O91), Eyl] so by def. 3.5 
9xli, V1 < GL, Li, vy]. 

(2.3.2), i< k-LaAj<k—-1 ori =k—-laj<k—1. We now assert and 
will show that Rg,[i, y] < Rg,L, [i, y]]. We prefix by 2.3.2.A the various 
cases which arise in showing this assertion: 

(2.3.2.A.1). i< k—Laj < k-1. Using induction hypothesis we see that 
Rgx li, ¥) = 9-116, 9] < 94-1; [Ev] = Ro, Li, vy). 

(2.3.2.A.2). i= k-lLaj<k-—-1. Then Rg,[i, vy] = 14+H;,y: 

(2.3.2.A.2.1). y = 0. Then 1+H,y < Rg;,[), [i, y]] by lemma 4.9(a). 

(2.3.2.4.2.2). y#0. Now by lemma 4.9(a) Rg,[j, [i,y]] > 1 and is 
connected, so it suffices to show that H,y* < Rg,|j, [i, y]] for every non- 
zero component y* of y. Let y* = [s, v] be such a component: 

(2.3.2.A.2.2.1). s > k—1. Then H.y* = 0 so trivial. 

(2.3.2.A.2.2.2). s < k—1. By two successive applications of the induction 
hypothesis we get g,-10 < g,-,[5,0] < 9,-1[i, [s, v]]: 

(2:3.2,A.2.2.2.1), 7 =e = I ene gelato alee ater p= 
= 1th -p-j Galley eae a 

(2.3.2.A.2.2.2.1.1). s = k—1. Then by lemmas 3.9(a) and lemma 4.5(b) 
Ayy* = Ay [s, 0) = bas. —ce—1)(Ge-1?) < Ane 1—ce-1)(Ge-1 LS, v]) < 
< fs 1-ce-1)(9e-1 li (5, 0))) = Pee oS. -1) Ss 
S has 1—ce-1)(Gx-1Ui Y)). 

(2.3.2.A.2.2.2.1.2). s<k—-1. Then H,y* = H,[s,v] = 9,-,[8, 0] = 
= 9x-1y¥* < 9x-1y. By induction hypothesis g,-,y < g,-,[i, y]. On the 
other hand recalling that y # 0 we see that g,_,[i, y] A 2,41-q-—1) Which 
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together with the fact that Sg,_,[i, y] = n+1—(k—1) gives us by lemma 
B90) goile yl seats epee Cee 

(2.3.2.A.2.2.2.2). 7 <k—1. Then Rg,L), li, y]] = 94-1; (i, y]]. We now 
assert that H,[s, v] < 9,-,[s, v], forifs < k—1, then H,[s, v] = 9,_,[s, v], 
while if s5= k—1 we get 7 \s, v] => Ape 1) Gx-12) < 
S Ane 1 —(e-1)(Qn 41-1) FIn- 12) = 9,-1[k—1, v] = g,~,[s, v]. Next we 
get by two successive applications of the induction hypothesis: g,_ ,[s, v] < 
= Gay =< GF -1l ee-1w, tv IF 

This concludes the proof of our original assertion made at the outset of 
case 2.3.2 that Rg,[i, y] < Rg,L), [i, y]]. To complete the proof of our lem- 
ma it suffices to show that ue (K,41-,R9[i, ¥] U Ky41-1Q9;[i, y]) im- 
plies u < g,[), [i, y]]. We prefix the various cases in the proof of this by 
DoSrweArlbe 


(2.3.2.B.1). u < Q,1,-,. Then since Sg,[j, [i, y]] = 1+1—k we have by 
defs. 4.2 and 3.5 2,.,-.< 9,L/, [i yl. 

C3 2B 2) > Oe. 

(oe Be) we Ke, Og, lt, yo then? =k — 1 (otherwise v = @,,.,-;) 
so then Q9,[i,v] = 2,41-.+(ByY + 9.F iy) 80 UE K,41-492F,y* where 
y* is a component of y of the form [s,v] where s > k (ifs <k then 
9,.F,[s, v] = 0). Now Sg, F,[s, v] < n+1—k, sou = g,F,[s, v]. Using lem- 
mas 4.8(a) and 4.3(e) we see that g, F,[s, v] < 9,[s,v] < (0, 9;[s, v]). By lem- 
ma 4.3(a) together with the fact that vu > Q,4,-, we see S(0, g;[s, v]) = 
= n+1—k. Furthermore R(0, g;[s, v]) < Rg,L), [i, y]] so it would suffice 
to show that z é K,41-,g9;[5, v] implies z < g,[j, [i, y]]. If z = 1 this would 
be trivial, if z# 1 then recalling that [s, v] is a component of y we see that 
zé€ K,41:-,9;y so we can invoke lemma 4.13 and then lemma 3.10. 

(2.3.2.B.2.2). we K,41-,Rg9xLi, y]. Then by lemma 4.14 u < g,[j, [i, y]]. 


LemMaA 4.16. /f2<k < n+l, 1 <i<k, and 1<s<k, then 1<ze 
K,41-x9sli, x] implies z < g,y where y <, [i, x] and Ly < Lf{i, x]. 
Proor. By induction on w-: (L[i,x])+s. Our induction hypothesis is 
that w- (L[i', x’])+s’ < w-(L[i, x])+s implies that zé€ K,41-,9¢[i', x'] 
implies z < g,y where y <, [i’, x’] and Ly < L[i', x’] provided that 1 < 
<i’ <ik ama il < os < ike 

Cl) ese likeneg. (1, 0 (ge (rer, _,). 01S nal —k 
so z€ K,.1-s59s[i, x] implies ze K,+1-,9;-1[i, x]. It remains only to apply 
the induction hypothesis. 

Qe =s—1, Then 9-(7, 2.) —=8(12eH, x, QS, xeigeF x) eT hus 


394 H. LEVITZ E XXIV 


since by hypothesis z > 1, we get ze K,+,-49,l?, x] implies z € K,4,-,H,x 
or ZE Ky41-495 FX: 

(1). 26 K,.,-,H,x. then 22K, =, .% where x* 1s a compos 
of x of the form [j, w] where j < s—1. By lemmas 2.7 and 2.9(a) [j, w] <, 
<, U, Uj. w)] <, [4 x] and, of course, Lj, w] < L[i, x] so it would suf- 
fice to show that z < g,y where y <, [j, w] and Ly < L{j, w]; 

2117 = s—l, Then Axe g.lj, wlsince 7 <s— 1 =< ko we seimew 
induction hypothesis z < g,y where y <, [j, w] and Ly < L[j, w]. 

Q12)07 = sl) Then Ho ee Gee) 

(2.1.2.1). SH,x* < n-loke Then z = f,41-(5-1)(9.-1)- Since by 
hypothesis z > 1, we see that w # 0 and by lemma 4.3(a) we see that each 
non-zero component of w has index equal or greater than k. That is, w = 
= [2,,u,] 4 =... 4 la, uw.) whetet= Vand a2 x tor [r= 7) By lemme 
4-3(8) g.-114,, ie 9.1a,,u,] for ler < t. So z= bys —(5-1)(Gs-1W)S 
< Ays1-(s-1(9.¥) and by lemma 3.9¢h) ey eerie) 
< Miya GeW) < Aya —e(Qne1-e + Gn”) = GJulk, w]. Thus [k, w] plays the 
role of the y called for in the statement of this lemma, for by lemma 2.9(b) 
and lemma 2.7 we get [k, w] <, [s—1l, w] = [j, w] and L[k, w] < LI, w]. 

(2.1.2.2). SH,x* > n+1—k. Then ze K,4;-.9,-1w* where w* is a com- 
ponent of w. Since z > 1, w* has the form [a, u]: 

(2.1.2.2.1). a 2 k. Thenz = g,_,|a, w|iand by lemma 4:3(¢) g--le,2 

< g,{a, u] = g,w*. Let y = w* and note that w* <, [j, w*] <, Lj, w] and 
Lw* < LIj, w). 

(2.1.2.2.2). a<k. Then by induction hypothesis z< g,y where 
y <,[e, a) = w* =<, |7, vw) and Ly = 2 laren) sew = Iw) 

(2.2). 26 K,44-,9,f.%. When 2e keg. peo where <C" iseamcome 
ponent of x of the form [j,w] where j 2 s (if j <s, F,x* = 0). Since 
x* <,li, x*] <, [i, x], and Lx* = L]i xp at would sufiice to show jihae 

& gy where y<,x* and Ly < Lx: 

0.21). FLU.) = U,w). So 9,F.Li,] = sli, wI: 

(2.2.1.1). j = k. Then Sg,[j, w] < n+1—k so z = g,{j, w]. By lemma 
4.3(g) gsLi.w] < gelf, w] so let y = Ej, w] = x*. 

(2.2.1.2). j < k. Then by induction hypothesis there exists y such that 
Z<& g,v where y <, [J,w]—x* and Zy = Liz, 7). 

(2.2.2). F,[j, w] = w. For this to happenj = sand w = [t, uw] wheret < s. 
So z€ K,41-, 9st, u] and by induction hypothesis there exists y such that 
Z€g,y, ¥ <,[t, uj, and Ly < Liu]. But [t, uv] lis ul) = [rw Py 
lemma 2.7 and of course L[t, u] < L[j, w]. 
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(2.2.3). F,Lj, w] = [s, v] where v + 1 = w. For this to happen s = j. By 
induction hypothesis there exists y such that z < g,y and y <, [s, v] and 
Ly <= Lis,0|) But [s, oly, 0) eo, w) and Ls, v0] < 2), w]: 

Gyr > s=ISog st = =. 8ner gx). Prom nal —7 > 
>n+1—kandz> 1 followsze K,.,-,g;x* where x* is a component of x 
of the form [j, w]. Since [j, w] <, [i, x] and L[j, w] < L[i, x] it suffices 
to show that z < g,y where y <, [j, w]and Ly < L[j, w]: 

(3.1). j < k. From ze K,,,_,9;[j, w] the existence of the desired y fol- 
lows by the induction hypothesis. 

(3.2). j 2 k. Then z = g,[j, w] and by lemma 4.3(g) g;L/,w] < 9[, w] 
so let yp — (7, w]- 


Lema 4.17. If 2<k < n+l, then 1 < 26 Kys4—xling1—ce—-1)(Ge-1) im- 
plies that z < g,y : aa <, [k-1, w] and Ly < L[k—1, w]. 

PROOF. 

(1). Sha+1—ce-1)(Ge-1W) > n+1—k. Then z € K,41-49,-1w* where w* 
is a component of w of the form [?, uw]. Since [t, vu] = w* <, [kK—1, w] and 
L{t,u] < L[k—1, w] it would suffice to show that there exists y such 

that z <, g9,y where y <, [t, u] and Ly < L[t, u]: 

Chl eee Themeze—o,_,|¢, uv]. By lemma 4.3(¢) ¢,_,[f, us 9, \t, ¥| 
Sonlety —) [f, 1): 

(1.2). t < k. Then since z € K,41-49,—1[t, u] we have the existence of the 
desired y guaranteed to us by lemma 4.16. 

(2). Shas 1-(e-1)(Ge-1W) < n+1—k. Then z = hay y—ce—1)(Gu-1)- 
Then the same argument which is used in case 2.1.2.1 of lemma 4.16 can be 
employed word for word (writing & for the s used there) to show that 
z <g,{k,w]. So let y = [k,w] and note that [k, w] <,[k—1,w] and 
L{k, w] < L[k-1, w]. 


THEOREM 4.18 (Order preservation). Jf 0<k <n+l, thenu<,v in P*(n+1) 
implies g,u < g,v in Z*(n). 

ProoF. Our proof is by induction on w - (Lw+Lv)+k. If u or v has more 
than one non-zero component we can apply induction hypothesis immediate- 
ly, while if « = 0 our conclusion is trivial. So we need only consider the case 
that uw has the form [i, x] and v has the form [j, y]. 

(1). &<i=j. Then by lemma 2.9(a) x <;y, so by induction hypo- 
thesis g;x< g;y. Thus using lemma 3.9(a) 9, [i,x] = Myo 1—1(Qn41-i + 9X) < 
< Migs —e(Qaa1-it GY) = IL; YI. 

(2). i#j vi<k. By lemma 2.8 we have three cases to consider: 
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(2.1). k = OAV <iv(j = iax <;y)). By virtue of the basic assump- 
tion of case 2 we see that k = Oaj < i, but then Sgo[i, x] = n4+-1—i<n+ 
+1—j = Sgo[j, y] by lemma 4.3(a), so gofi, x] < 9oLj, y] by definition 3.5. 

(2.2). k > Oaj < ka [i, x] <, y. Then by induction hypothesis g,[i, x]< 

< g,y while 9,» < 9,[j, y) by lemma 4.15. 

(2.3). k > OA([E x] <pan Lp AG > kV < kax <, [, y)))P By 
the induction hypothesis g,_,[i, x] < 9,-,[, yl]: 

(2.3.1). i > k. Then by virtue of the basic assumption of case 2 we 
get i # j, while from lemma 2.9(b) we see that j > i is impossible. Thus 
j <i, but then using lemma 4.3(a) we see that Sg,[i, x] =n+1l—i< 
<n+1—max{k,j} = Sg[j, y], 80 li, x) < gL), y] by def. 3.5. 

(2.3.2). i< kKAx <,[j, y]. From lemma 2.9(b) we see that it cannot be 
the case thati < k <j, thusi < kaj < k. On the other handi =kaj=k 
is impossible by virtue of the basic assumption of case 2. Furthermore using 
lemma 2.9(b) together with the previous assumption that [i, x] <,-, Lj, y] 
we see also that i < kKAj = k is also impossible. Thus we have only to con- 
sider the two casesi = kaj < kandi< kaj <k: 

(2.3.21).i=kay<k Them 9,\e x)= 44238, er oe ee om 
x <, [j, y] we get by induction hypothesis that g,x < g,[/, y]. Furthermore, 
from def. 4.2 we see that 9, [j, y] = (Z, 2,41-,+w) for some z > 1 and some 
w 2 0. Thus using lemma 3.10 we see that 2,4;-,+9,x < 9,[j, y] and then 
using lemma 3.9(d) we see that A, 44 —-,(Qu41—-~+94%) < %& Lj, ¥).- 

(2.3.2.2). i< kaj <k. We now assert and will show that 
zéK,+1,-,Rg9,[i, x] implies z < g,[j, y]. We prefix the various cases which 
arise in the proof of this assertion by 2.3.2.2.A. We can assume z > 1 for 
otherwise our assertion would be trivial: 

(2.3.2.2.A.1). i< k—1. Then Ro,[?, x)= 9,-i[4 x) By lemmar4ale 
26 K,44-19.-11i,X] implies 2 < g,w for some w <, [z,-7) wherew we. 
< L[i, x]. Now by induction hypothesis g,w < g,[j, y], so Zz < 9,[j, yl. 

(2.3.2.2.A.2), i= k—1. Then Ro,fi, x) =1+H,x so ze K,,. ae 
where x* is a component of x. If H,x* = 0 our assertion would be trivial, 
SO We can assume that x* = [t, p] for t < k and p some ordinal diagram: 

(2.3.2.2.A.2.1). << k—1. Then H,x* = g,-,[t, p] so we need only ap- 
ply lemma 4.16 and induction hypothesis as we did in case 2.3.2.2.A.1. 

(2.3.2.2.A.2.2). f= k—-1. Then Ay x* = hys—q—1)(Ge-1p). By lemma 
4.17 z < g,w where w <, [kK—1, p] and Lw < L[k—1, p]. But [k—1, p] = 
= x* <, [k—1, x*] <, [i, x] and L[k—1, p] < L[i, x] so by induction 
hypothesis g,w < g,[j, y], thus z < g,[j, y]. 
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This completes the proof of the assertion made at the outset of case 2.3.2.2. 
It remains to show that Rg,[i,x] < Rg,Uj, y]A Qg,[i, x] < 9,[j, y] or 
Rg, [i, x] = Rg, (j, y] A Og, [i, x] < O9,[j, y]. We prefix the cases which 
arise in showing this by 2.3.2.2.B: 

(2322.81). 1=k—1, Then QOg,[), a) 2B «ee ofF, xe We 
first note by lemma 4.8(a) that g,F,x < g,x, but x <, [i, x] by lemma 2.7 
So 9,x < g,[j, y] by induction hypothesis. Using lemma 3.10 we see that 
Q,4+41-x < %&L, y]. Noting further that B,x < 1 we get 2,41:-,+(B.x + 
tt FX) < IL, VI: 

(2.3.2.2.B.1.1). 7 <k—1. Then Rg,[j, y] = 9-1[j, y]. Now each com- 
ponent of Rg,[i, x] is of the form H,x* where x* is a connected component 
of x. If H,x* = 0 then trivially H,x* < Rg,[j, y] so we can assume that 
x* = [t, z] where ¢t < k and z is some ordinal diagram: 

e228. It —k—1. Uhen AH,x =f, 1 -@-1(%:-12). Now 
ee ete ae ee 1 |, x] <, Se )\so by induction hypothesis 
9x—-12 < Jx-i1Lj, y] and by lemma 3.9(d) hy41—(e—1)(Ge-12) < Ge-1 14, VI. 

52 eB le a | hee He et, 2 Nows(t, 2] <.<4 
[i, x] <,—, Lj, y] so by induction hypothesis g,_,[t, z] < g,-,U, yl: 

(e223 2) ye ole Promeli, x) =p= ly, y] follows x <,.,y by 
lemma 2.9(a). Write x = x, + x2 where each non-zero component of x, 
has index equal or greater than k an each non-zero component of x, has 
index less than k. In a similar fashion write y = y, + y,. Nowx <,_,y im- 
plies (x2 = ¥2A%y <p-1¥1) V (%2 <y-1 2): 

(a 22 Bel ee yy 2) Phen by def: 42: Rg,li x) = 
= 14+AH,x = 1+H,x, = 14+AH,y2 = 14+A,y = Ro,[j, y]. We now show 
that Qg,[i, x] < Qg,[j, y]. From lemma 2.13(c) and def. 2.12 we see that 
oo eet yy — 1, ), linus by induction hypothesis 9, f,x =99,F_y. 
Now recalling that B, has for its range the values 0 and 1 we assert and will 
show that Bx + 9,F,x < B,y + 9,F,y; from this Qg,[i, x] < Qg9,[j, y] 
would readily follow: 

(2:3.2.25B 1:2 lel ) GB, al A Bal) V (8, x = 0). Then trivial. 

(2.3.2.2.B.1.2.1.2.) B.x = 1AB,y = 0. If our assertion did not hold we 
would have B,x + 9,F,x = B,y + 9,F,y. But then by def. 4.1 x would be 
of the form [s, z] + /, for some s < kK—1, some z, and some ¢ 2 0. This 
would mean that g,F,y =/,,,. Using the fact that x, = y, together 
with lemma 4.7(c) we would have that y = [s,z] + 7,,,. But then by 
def. 4.1 B,y = 1 which is contrary to our assumption. 

(2.3.2.2.B.1.2.2). x2 <,~1 ¥2- We now assert and will show that H,x. < 
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< H,y, from this would follow Rg,[i, x]= 1+AH,x=1+H,x,<1+A,y,= 
= Rg,[J, y]. It is sufficient to consider the case where x, and y, are connected: 

(2.3.2.2.B.1.2.2.1). x, = [k—1, 2] and y, = [k—1, w] for some z and 
some w. Now z <,;_,W SO Q-,Z < g,-,w by induction hypothesis. 
Using def. 4.2 and lemma 3.9(a) we get Ayx2 = hyai1—(e—1)(Ge-12) < 
< hin —-1)(9e-1 ¥) = Ay 2- 

(2.3.2.2.B.1.2.2.2). x, = [k—1, z] and y5 = [#, w] for some z, somemy: 
and some t < k—1. Then from z <,_, [k—1, z] <,-, [t, w] it follows by 
the induction hypothesis that g,_,z < 9,-,[t, w], so using lemma 3.9(d) 
with def. 4.2 we see that Hy x2 = hys1—c—1)(Ge-12) < Gx-alt, W] = Ayy2- 

(2.3.2.2.B.1.2.2.3). x, = [t,z]eand 5 —"[s, wi) for some Zz, wane 
some t, s < k—1. Then using induction hypothesis we have that H,x, = 
= 9,-11t,2] < gx-1[8, w] = Ayo. 

(2.3.2.2.B.1.2.2.4). x, = [t, z]Jandy, = [k—1, w] forsome z, wand some 
t<k-—1. Fromt < k—1 follows y, <,_,x,, so by lemma 2.8 we see that 
X2 <,-,w. If x, <,_,w then by induction hypothesis g,—,x2< 9,-1 
so we have, using lemma 3.9(d), that Ayx2 = gy-1%2 < Gr-1W < 
< ho41-c@-1)(Ge-1”) = H,y2. On the other hand if x, = w then using 
the fact that t < k—1 along with lemma 4.6(a) we get Hx. = 9,-,[t, z]< 
< Ags —e-1)(Ge-1 lt, Z]) = (Peer sy) Che w) = A,[k—1, w] = A,y2. 

(2.3.2.2.B.2). 1< k—1. Then Og), x= Game, o,1))y ebvalemma 
3.10. Furthermore Rg,[i, x] = g,-,[i, x]: 

(2.3.2.2.B.2.1). j < kK—1. Then frome[7, x] <,2, [j)y] it followsms, 
induction that g,—,[i, x] < 9-11, ¥] = Raoul, v1. 

(2.3.2.2.B.2.2). j =k—1. From i < k—1 follows [j, y] <,-, [i, x] so 
by lemma 2.8 [i, x] <,-, Lj, y] can only happen by virtue of having 
peel] eri ie 

(2:3:2.2.B.2:2:1). [Fx] =9e Thens,7 = I) seg, li, <] — 
= (1+9,-1[%, x], Q,41-,) while gL, y] = (+ 9n-1 [i x], Qn41—% # 1). 

(2:3,2.2.B.2.2 2) G2 lene 

(2:3:2.2.B.2:2.2.1 )2ye— |i, 3b where z = 0) Them aia — 
= (1494-117, x], Qn41-%) while 9, Lj, ¥] = (1+ 9.-1 [i x] # 
4h AZ, Q,4,-.+(B, (i, x] + z) 4 g, F,z)). But H,z #0 or 9,727 4 0, 
in either case we are finished. 

(2.3.2.2.B.2.2.2.2). [i, x] <,_,y* where y* is a component of y. Then y* 
must have index less than k: 

(2.3.2.2.B.2.2.2.2.1). y* has index less than k—1. By induction hypothesis 
9x-11i, X]<Gx-1¥*, SO O-s[t, X]< 9-1 ¥* = Hyy* < 14+ Ay = RGU, v1. 
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(2.3.2.2.B.2.2.2.2.2). y* = [k—1,z] some z. Now i < k—1 means that 
We lal 257, x] Soto have [i x|l<,2, "> we must have [7, x] <,2) 2. 
liv, x] <,2)2 then by induction g,-\ (7, x] < g,-,2 so by lemma 
3.9(€) 9—sfi, x) < 9p-12 < Mas 1—ce-1)(Ge-12) = Aylk—1, 2) = Ayy* < 
< 1+H,y = Rg,{j, y]. While if [i,x] =z we have using lemma 4.6(a) 
Gu— 118, X) < Pigg 1-1) (Gu-1 8 4) = Ane te 1)(Ge-12) = Ae(k—-1, 2] = 
= My* < 1+H,y = Rg,[), y). 


DEFINITION 4.19. We define D(n: k) for 0 < k < nas follows: 


eee | (1, Oi if ea, 
(ca) A RRR ANE ae 


It is easy to see that SD(n: k) = k from which it follows that 7 <j implies 
D(n: i) < D(n: j). 


THEOREM 4.20. (Closure). Jf0 < k < n+l, then g,x < D(n: Sg,x). 

Proor. By induction on w:ZLx+k. Our induction hypothesis is that 
Ge x’ < D(n: Sgyx') whenever w@-Lx'+k' <@:-Lx+k. We must show 
under this assumption that g,x < D(n: Sg,x): 

(1). x = 0. Then trivial. 

(2). x has more than one non-zero component. Let these components 
poe eee lon? = 2) By induction hypothesis gx, — D(a: Sg, ~x,) < 
Samar) (nSg,%,) — D(n: max; Sg,x;) = D(n: Sg, x). 

(3). x has exactly one non-zero component. Then x has the form {i, y] for 
some y: 

(Cale ee Dhemigec 91, yl 2, + iy). Using lemma 
4.3(a) we see that Q,.,-; < D(n:n+1-—i) = D(n: Sg,x). Furthermore by 
induction hypothesis along with lemma 4.3(a) we get g;y < D(n: Sg;y) < 
< D(n:n+1—i). Thus 2,4,;-;+9;y < D(n: n+1 —i) and by lemma 3.9(d) 
SO iS Ags —4(Qn41-1+9:). 

G2)eiee kool hen 9.1.) ea eee ee). By induction 
hypothesis g,_,[i, x] < D(n: Sg,-1{i, y]) = D(nt:n+1—(k—-1)) = 
RD(n: Sg,{i, y]). Thus Rg,{i, y) < RD(n: Sg,{i, y]). Also, Qg,{i, y] = 
= Q,41-~< D(n: Sg,{i,y]). It remains only to show that z¢K,4,-;Rg;[i, y] 
implies z < D(n: Sg,[i, x]). Let such a z be given then z € K,41-49x-1[, y]. 
But then by lemma 4.16 z < g,w where Lw < Lfi, y]. By induction hy- 
pothesis g,w < D(n: Sg,w) < D(n:n+1—k) = D(n: Sg,{i, y]), so z< 
< D(n: S9xli, y)). 

(3.3). i= k—1 then g,{i, y] = I+ALY, Que1-n+ (Buy 9 Fuy))- We 
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first note that LF, y < Ly < L[i, y] so by induction g,F,y < D(n: Sg, A.V) < 
< D(n:nt+1—k) = D(n: Sg,{i, y]). From this it easily follows that 
09,[i, y] < D(n: Sg,[i, y]). Using lemmas 4.16, 4.17, and induction hy- 
pothesis we show in a fashion similar to case 3.2 that zé€ K,,,-,Rg9,[i, y] 
implies z < D(n: Sg,[i,y]). It remains to show that Rg,[i,y] < 
< RD(n: Sg,[i, y]), but this is also handled in a fashion similar to case 3.2. 


COROLLARY 4.21 (Closure). 9,41x < D(n: 0). 


LEMMA 4.22. (x,y) < D(n:k)ay > Q, implies x < RD(n: k). 

PROOF. 

(1). x = 0. Then trivial since RD(n: k) is never zero. 

(2). x # 0. Suppose to the contrary that RD(n: k) < x: 

(2.1). RD(n:k) = x. Then since QD(n: k) = Q, < y it would follow 
that D(n:k) < (x, y), contradiction. 

(2.2). RD(n: k) < x. It is easy to show that ze K,RD(n: k) implies 
z<1. But 1 < (x,y) since x #0. Moreover Q, < y < (x,y). Thus 
D(n: k) < (x, y), contradiction. 


DEFINITION 4.23. To each term wu of X(n) we define recursively the notion of | 
subterm of u and the notion of depth of a subterm of u in u. Au will denote 
the set of subterms of wu. 

(1). Ifw is Q; then Au = {0,u}. These subterms will have depth 0 in w. 

(2). If uw is (x,y) then Au= {u}U AxU Ay. u will have depth 0 in uw. 
While if z has depth iin x or y, then z has depth i+1 in u. 

(3). If u has components v,0,5°..u,(r = 2) then Ay — |S Ae 
z has depth 7in u,, then z has depth 7 in w. 


Note that u = v implies Au = Av. 


THEOREM 4.24. If v < D(n: Sv) whenever v is a subterm of u, then there 
exists a w such that u = 9,+1~s,(W)- 

ProoFr. By induction on Lu. Denote Su by k: 

()) z= © Then taviall 

(2). «has more than one non-zero component. Let u* be such a component 
and denote Su* by k*, so k* < k. By induction hypothesis u* = g,+1—,+(x*) 
for some x*. By lemma 4.3(d) u* = g,,,-,2* for some z*. Thus u = 9,41-,2 
for some z. 

(3). uw has exactly one non-zero component: 

(3.1). u = Q,; for some j 4 0. Then uv = g,4,-;[n+1—J, 0]. 

(3.2). uw = (x,y) for some x and some y. Since Sy = k we can write 
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y = Q,+1t for some t. Moreover every subterm of ¢ is a subterm of u, so by 
induction hypothesis we can write tf = g,,-,v for some v: 

(3.2.1). x = 0. Then u = (0, y) = (0, 2,4+-9,+41-,v). Using lemma 3.9(b) 
two cases arise: 

(G21.1)) 0, OF ge!) — 2 0--9.4,-.0). Then u= 
= Jn+1-n[n+1—k, v]. 

(3.2.12) (0, Gage) = yO, gee,(v) 4 1). Then a = 
= 9n+1-4[0+1—k, v + 1). 

(Oval eee UWS 

(3.2.2.1). x = (p, g) where p #0 and Sx = k+1. Then by induction hy- 
pothesis x = 9,41-(,+1)(W) for some w, and we see by lemma 4.3 that w = 
= [i, z] for some z where i < n+1—(k+1): 

G22 et — 0) henge — 9, 4- 10.21: 

Cee) tor some vin 0, Lhenw— 9.4, 2,(0—Kk, 1,4 [t, 2)|: 

(3.2.2.1.3). otherwise. As seen previously t = g,+,-,v for some v, but by 
leone. 1S (aj eee eee Dicrerare’— Gree (hci oy( Jue ee) 
thus uw = 9,41-,4l9—k,fr+1-x¥ # (i, 2]. 

(222) axel Deni — Gee, i gee le: 

(3.2.2.3). otherwise. So x = 1+r for some r. Since u = (x, y) is a sub- 
term of wu by hypothesis, we have that (x, y) < D(n: S(x, y)) = D(n: Sy) = 
= D(n:k). Using y > Qs, = Q, we see from lemma 4.22 that x < RD(n: k). 
Now k # n for otherwise RD(n: k) = 1 and x = 0 contrary to the basic 
assumption of case 3.2.2. Since k An, RD(n:k) = D(n:k+1), so x < 
< D(n:k+1) from which it follows that Sx < k+1. Now by lemma 3.9(c) 
we see that it is possible to write r = p + q where each non-zero compo- 
nent q; of q can be written in the form g; = (4a;, b;) for some a; # 0 and some 
b; such that Sq; = k+1, and each non-zero component p; of p can be 
written p; = h,+,2; for some 2Z;: 

(3.2.2.3.1). p = 0. Then q has more than one non-zero component (other- 
wise we would contradict the basic assumption of case 3.2.2.3.) call them 
91>92>--+9-(c = 2). By induction hypothesis we have that for1 <i < c, 
9i = Gn-,W; for some w;, and by lemma 4.3(c) w; = [j;, 5;] for some s; and 
somej,; < n—k. Thusu = g,41-412—K,fnai—nv # Li. 5] +... Ye Se]]- 

(3.2.2.3.2). p # Oandq # 0. Then p has components p, ,p2,..-, Pg (d 21) 
and q has components q,, q2,..-,4-(¢ > 1). As in the previous case g; = 
= g,-.W; for 1 < i < c where w, = [j;, 5;] with j; < n—k. As previously 
noted p; = h,4,2;for 1 < i < d. By induction hypothesis z; = g,+4~-s-,(€;) 
for some e;. But as we have already seen Sx < k+1 so Sz; = Sp; < Sx < 
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<k+1. This gives us the right to employ lemma 4.3(d) getting 
Z; = Gn-(é;) for some e, a Thus 
ne 
Fnti-nv # [n—k, e:] # ... + [n—k, eg] # Li. 5a] #.-. # Lie, Sel]. 
(3.2.2.3.3). p= U0 and q@ = Then py has components fp), 72>. 0es 
(d > 1) and as in the previous case we see p; = /y419,-,€; for some e; so 
u = [nok fee + ln — lee eee in Ser 


LemMa 4.25. Let {a,} be a sequence of non-negative integers such that 
ay = Oand for some j, aj, < a;, and a,,,; <a,+1 for alls such that0 < 
< 8 <j, then there exists ani where 0 < i<j such that aj;4,= a; and dj4,< 
<a, for alls such thati<s <j. 

Proor. We first note that 7 > 0, otherwise 0 = dy = a; > @;4, which 
violates the assumption that all terms of the sequence are non-negative. Let 
N= {tlO0<t<jaa, <a,,,}. Froom0<0<jf and a =0< ayy, it 
follows that 0€ N so Nis non-empty. Let 7 be the greatest member of NV. We 
now show that 7 has the properties ascribed to it in the statement of this 
lemma. From i <j we geti+ 1 </j: 

(1). i+ 1 =j. Thena,,, <u; = aj4, < a;+1,s04;4, < a;. Onthe other 
hand from the definition of N we see that a; < a;41, 80 a; =a;4,. Further- 
more i < s < j = i+] implies that s =j so a, = a; > Qj44. 

(2). i+1 <j. We now assert that a; = 4,41, for if a; < a;4, it would 
follow that a;4, < a;+1 < a;4,, but then (+1) €N, and this would con- 
tradict the fact that zis the greatest member of N. It remains only to show that 
i< s <j implies a, > aj41: 

(Qi) s — 7. Then aa a 

(2.2). i<s <j. Then if a, < aj4, we would have s¢ N which contra- 
dicts the fact that i is the greatest member of N. 


Lema 4.26. [fu < D(n: 0), then for every subterm u* of u: 
(a): ute Din: Sut 
(b). u* = (x,y) implies Sx, Sy < Su*+1. 

Proor. We prove the conjunction of (a) and (b) by induction on the depth 
ofu* inu. We first note that (b) follows from (a) since first of all Sy = Su* < 
< Su* +1, moreover from (x,y) < D(n: Su*) and y > Qs, = Qs,» it 
follows from lemma 4.22 that x < RD(n: Su*), thus Sx < SRD(n: Su*) < 
< Su*+1. 

We now show that (a) holds under the assumption that (a) and (b) both 
hold for subterms of smaller depth: 
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(1). u* has depth 0 in wu. Then u* is a component of u so Su* = 0 and 
eau =< Din. 0) = Da Su). 

(2). u* has depth k+1 in uw. Then there exists a subterm z = (x, y) of 
depth k in u such that u* is a component of x or u* is a component of y: 

(2.1). Su* = Sz. Using lemma 3.10 we see that u* < z, But by induction 
hypothesis z < D(n: Sz).= D(n: Su*). 

(2.2). Su* > Sz. Then u* is a component of x, so u* < x. Since by in- 
duction hypothesis (x, y) < D(n: Sz) we see using lemma 4.22 together 
with the fact that y > Qs, = Qs, that x < RD(n: Sz): 

G21). Sz=ay Then RD(n: $z)] 1 < D(meSu*). 

(2.2.2). Sz <n. Then RD(n: Sz) = D(n: Sz+1) < D(n: Su*). 

(2.3). Su* < Sz. Since u* has depth k+1 in u, we can find a sequence 
Up, Uy,++-> Mp4, Of subterms of u such that up is a component of u, u, = Z, 
U4, = u*, and for each s such that 0 < s < k the depth of u,,, in u equals 
the depth of uw, in uw plus one. By the induction hypothesis Su, < Su,+1 for 
0 <t< _k, so by lemma 4.25 there exists an i such that 0 < i < k and such 
that Su, = Su,,, and such that Su,,, < Su, whenever i < t < k. Thus 
Un41 € (Ks, Ru; U Ks,,Qu;) thus it follows by lemma 3.10 that u* = 441, < 
<u;. But by induction hypothesis wu; < D(n: Su;) = D(n: Stuy41) = 
= Din: Su*). 


Remark. The above lemma does not generalize by replacing the 0 by an 
arbitrary positive integer. To see this note that in the system 2*(2) we get 
(Q,,0) < D(2: 1) but it is not true that (Q,,0) < D(2: 0). 


THEOREM 4.27 (Onto-ness). Jf u < D(n:0), then u = g,4,w for some w. 
PRooF. Immediate consequence of lemma 4.26 and theorem 4.24. 


Remark. The obvious generalization of the above theorem would be that 
u < D(n: k) implies u = g,.,-,(w) for some w. This proves to be false as 
can be seen by the following considerations: in the system 2*(2) if we let 
u = (23,0), then u < D(2:1). On the other hand we see from def. 3.5 and 
lemma 4.3(a) that if (x, y) is an image under some g;,, then Sx < Sy+1. 


THEOREM 4.28. ||(P*(n+1), <,41>[| = ||D(m: 0) € 2*(n)|I. 

Proor. By corollary 4.21 g,,, is a mapping from P*(n+1) into 
{x|x € Z*(n) Ax < D(n:0)}. By theorem 4.18 it is order preserving. By 
theorem 4.27 it is onto. 


Remark. If n > 1, then the set of images under g; does not form an initial 
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segment when 0 <i<n+1. To see this note that 2,,,-; = g,{i, 0] 
is an image, (Q,+2-;, 0) < Qu+1-;, but (Q,+2-;,0) is not an image since 
SQ,42-; > SO+1. 


5. The main conclusions. In the previous section we showed an order pre- 
serving mapping from <P*(n+1), <,+1> onto an initial segment of <2*(n), 
<)>. We now show that ¢<O(n+1), <,41> can be mapped onto an initial 
segment of <2(n), <>. To do this we first remind the reader of the author’s 
result (cf. § 2) that |}(O*(n+1), <,41>1 =I/[1, [1, 1]] e <P*(#+2), <, 42511: 
Moreover it is only a matter of calculation to see that under the order pre- 
serving mapping 9,42 from <P*(n+2), <,4 > onto an initial segment of 
<2*(n+1), <> the ordinal diagram [1, [1,1]] is carried onto the term 
u, = ((.-- (((OM1), 2); O24), -.-); G5)) New mie the vorder presen 
mapping from <2*(n+1), <> onto <2(n+1), <> (cf. § 3) u, is mapped onto 
the term v, =9((-2(, 1)) 2, ) BORE p ie). 1). Se recalling (62) 
[<O(n+1), <p4 ill = IKO*(M+1, <pas>Il we have that ||(O(n+1), 
<,+n 1 = llv,€<2 (+1), <)>||. Again it is only a matter of calculation 
to see that v, < Q[n+1, O]'. According to ([11], theorem 6), v,¢2(n) and 
lv, €<2(n +1), <>|| = |lv, € <2(7), <>||. This concludes our argument 
that O(n +1), <,,44> is order isomorphic to an initial segment of (2(n), <)>. 

By calculation one can see that Q[n, 0] < v, and we have already seen 
that v, < Q[n+1, 0], so 


lim ||<O(m), <,,>|| = lim ||, € <2(m), <>|| = lim ||Q[m, 0] € <2(m), <)|I. 
m<o m<o m<o 

This latter limit was shown ([11], theorem 9) to be the order type of the lar- 
gest well ordered initial segment of | ),< 2(m). 
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SECTION F 


PROOF THEORY 


ON THE ORIGINAL GENTZEN CONSISTENCY PROOF 
FOR NUMBER THEORY * 


PAUL BERNAYS 


The first published Gentzen consistency proof for the formal system of first 
order number theory, including standard logic, the Peano axioms and 
recursive definitions, was given in Gentzen’s paper ‘Die Widerspruchsfreiheit 
der reinen Zahlentheorie’ (Math. Ann. 112 (1936)). It was however not his 
original proof but a revised version of it. The revision was motivated by 
a criticism, in which I myself for some time concurred of the original proof 
on the grounds that it implicitly included an application of the fan theorem. 
Gentzen did not expressly oppose this opinion; he took care of the criticism 
by modifying his consistency proof before it was published. Fortunately 
the text of the original proof is preserved in galley proof*. 

On rereading the original text, I came to doubt the mentioned opiniont. 
It seems worth while reconsidering the original proof, because 

1. it is certainly easier to follow than the first published proof and at 
least as easy to follow as the second Gentzen consistency proof, 

2. it does not require the generalized form of induction (ordinal induction 
up to &). 

I shall give in this paper a description of Gentzen’s original proof in a 
way sufficiently detailed to make apparent the kind of constructive methods 
here used. At the same time I shall make a slight simplification in this proof, 
making use of some remarks in Gentzen’s Annalen proof. 

I begin by describing the formal system to be considered. The logical 
calculus used is the calculus of sequents* I > A where I, the antecedent, 
is a finite (possibly empty) list of formulas separated by commas, and 


* Since Prof. Bernays was unfortunately unable to be present, this paper was read by 
William Howard. — Eds. 

* An English translation ofit willsoon appear in an edition of Gentzen’s works in English 
by Manfred Szabo. 

t Georg Kreisel had previously stated in conversation that he dissented from this opinion. 
+ ‘Sequent’ as a technical term should be distinguished from ‘sequence’ in its usual sense. 
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where A, the succedent, is a single formula. The logical symbols v (or), 
4 (existence) and > (implication) are eliminated, and only & (and), — 
(not), V (universality) are used. 

As Gentzen admits arithmetical function symbols (with the restriction 
that for numerical arguments the value of the function must be computable), 
there is no loss of generality in assuming that all prime formulas are equa- 
tions between terms. (Terms are built up in the usual way from free number 
variables*, numerals and function symbols.) 

The rules of the calculus, after the mentioned elimination, are the 
following: 

Initial sequents are: 

(1). Logical initial sequents, i.e. sequents of one of the forms 


A&B-A, A&B-B, A,B-~A &B, 
A, =A > 1 = 2, —4—4A--A, 
(Vx)F(x) > F(t), 


where x is a bound variable, and t a term. 
(2). Arithmetic initial sequents, i.e. sequents whose formulas are equa- 
tions and which have the property that by replacing each free variable by a 
numeral (of course equal variables by equal numerals) and by computing 
the function values, either the succedent formula gets the truth value ‘true’ 
or one of the antecedent formulas gets the truth value ‘false’, according to 
the usual valuation of numerical equations. 
The rules of inference are: 
(a). Rules of structural change in a sequent, permitting one 
(a,). to interchange the order of the formulas of the antecedent, 
(a,). to add an arbitrary formula to the antecedent, 
(a3). to delete a repetition of a formula in the antecedent, 
(a,). to change a bound variable of a universal quantifier, everywhere 
in its scope, into another bound variable. 
(b). Logical inference schemata 


To-A A,A>B 
T,A-B 


(cut) 


* Gentzen makes a notational distinction between free and bound variables. 

+ This schema does not occur in the original text as a fundamental schema, but it is deriva- 
ble from the schemata for negation there stated, which Gentzen later changed in § 14 of 
the Annalen paper. 
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IC = =) A 


T- F(a) 
T = (Vx)F(x) 


(negation introduction) 


(where a is a free variable not occur- 
ring in any formula of I’ nor in F(x), 
and where x is a bound variable) 
(universality introduction) 

(c). Induction 


T+ F(t) F(a), A > F(at+1) 
lr, A > F(t) 


where t is a term and a a free variable not occurring in T, A, F(1) 
or F(t). 


In this calculus from any two formulas A, — A we can derive, using the 
initial sequent A, — A> 1 = 2 and cut, the sequent > 1 = 2; hence, 
in order to prove consistency of the considered formal system, it is sufficient 
to show that the sequent > 1 = 2 is not derivable. 
A concept of ‘reduction’ of a sequent is introduced. The following are the 
possible ‘reduction steps’ on a sequent: 
(a,). Replacing a free variable, wherever it occurs in the sequent, by the 
the same numeral, which can be arbitrarily chosen. 
(a2). Replacing a function symbol all of whose arguments are constants 
by its value. 
(B,). When the succedent has the form (Vx)F(x), replacing it by F(k), 
where k is an arbitrarily chosen numeral. 
(B,). When the succedent has the form A & B, replacing it by A or by B, 
according to an arbitrary choice. 
(B3). When the sequent has the form [+ —A, replacing it by 
A,T 7-1 = 2. 
(y). When the succedent is a false numerical equation: 
(y1). Replacing an antecedent formula (Vx)F(x) by F(k), or adding 
F(k) to it in the antecedent, where k is a numeral. 
(y2). Replacing an antecedent formula A & B by one of the formulas 
A, B or adding one of these in the antecedent. 
(y3). If an antecedent formula — A occurs, replacing the succedent 
formula by A and possibly cancelling the formula — A in the 
antecedent. 
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The description of these reduction steps must be completed by the following 
rule of preference: The step (a,) has preference over all other reduction 
steps, and (a,), (#2) have preference over the other reduction steps. That 
means: the steps (a) are admitted only for a sequent which contains no 
free variable, and the steps (B,), (62), (Bs), (71), (¥2), (v3) are admitted 
only for sequents which contain neither a free variable nor a term which 
is not a numeral. 

Related to the concept of a reduction step is that of a ‘reduction process’. 
(Reduziervorschrift). By a reduction process for a sequent is meant a 
procedure consisting of a terminating sequence of successive reduction 
steps by which the sequent is brought into a ‘final form’ (Endform), i.e. 
to a sequent satisfying at least one of the two conditions: (1) that the succe- 
dent is a true numerical equation, or (2) that some antecedent formula is a 
false numerical equation. This defining property of a reduction process is to 
be understood in the strong sense that, whenever reduction steps of the 
kinds (a1), (B1), (Bz) occur in the procedure, the final form will be attained 
for every decision on the arbitrary choices, by suitably making choices in 
connection with steps of the kind (71), (v2), (73). 

A consequence of the rule of preference is that whenever for a sequent S 
there is a reduction process, there is a reduction process for any sequent 
resulting from S by replacing some or all of the free variables of S by 
numerals. 

As an illustration of a reduction process may serve the sequents of the 
form A> A. 

First, by applications of (a,) the free variables are removed. In the 
resulting sequent A* > A*, by applications of (B,), (B2) and (a) the suc- 
cedent is brought into either the form of an equation r=s where r and 
s are numerals, or into the form — C, each term in C being a numeral. 

In the first case the equation r = sis a numerical equation. If this equation 
is true (i.e. of the form m = m) then a final form has been reached. Other- 
Wise we have a sequent A*¥ — m = n with m = n false. Now we can apply 
the reduction steps (a2), (v1), (¥2) to the antecedent A*; and this can be 
done in an exactly parallel fashion to the reduction steps («2), (B,), (B2) 
previously applied to the succedent A*. In this way the antecedent turns 
into an equation m = n, so that we get a false numerical equation in the 
antecedent, and so the sequent is again brought into a final form. 

In the second case, where we have, after the first reduction steps, the 
sequent A* + — C, we get, by applying (3), the sequent C, A* > 1 = 2. 
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Applying here again to the antecedent formula A* the reduction steps 
(y1), (Y2), (#2) in an exactly parallel fashion to the steps (B,), (B2), (a2) 
previously applied to the succedent A*, we obtain the sequent 
C, =~ C- | = 2, then by applying (y3) we obtain C > C. Here the formula 
C contains at least one logical symbol less than A*. Proceeding with the 
sequent C — C in the same way as we did before with A* > A*, we either 
obtain a final form or we get a sequent D > D, where D contains fewer 
logical symbols than C. Thus, continuing in the same way, we come, if we 
have not already obtained a final form, to a sequent Q > Q, where Q is a 
numerical equation m = n. The sequent m = n > m = n has in any case 
a final form. 

It is to be observed that the course of the reduction following the given 
process depends on the choices to be made at the reduction steps («,), 
(B,), (B2), which are arbitrary. The succession of these choices constitutes 
a free choice sequence in Brouwer’s sense*. And the given reduction process 
consists in assigning to any such free choice sequence and to any sequent 
A — A a Succession of sequents, which in a finitist way can be seen to end 
with a sequent in final form. 

Using the method of the reduction process for sequents A > A, and also 
that process itself, reduction processes can be given for each of the logical 
initial sequents. The arithmetical initial sequents can be brought into final 
form by reduction steps («,), (a2). 

If it can be shown that for every derivable sequent there exists a reduction 
process, then it follows that the formal system under consideration is 
consistent. For there is no reduction process for the sequent > | = 2. 
Hence, in order to prove consistency, it is sufficient to show for each of the 
inference rules, that if we have a reduction process for the premise, or in 
the many-premise case for each of the premises, we get a reduction process 
for the conclusion. Let us try to state this for the various rules of inference. 

For the rules (a,), (a2), (a4) the statement is trivial. For (a3) the statement 
follows from the fact that in the reduction steps (71), (¥2), (73) one is 
allowed to leave unchanged the antecedent formula to which the step 
applies. 

For the schemata of negation and universality introduction the statement 
is obvious. 


* The choice of a member of a conjunction A & B ata step (82) can be transformed to the 
choice of a number, by agreeing that the first or the second conjunction member will be 
chosen, according as the number is odd or even. 
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In the schema of induction the conclusion T, A > F(t), by the use of 
steps (a1), (#2) can be turned into a sequent [*, A* + F*(n) where n is a 
numeral. By applying (a), (#2) with the same replacements of the variables 
of T, A, F(1) to the premises, these will be turned into the sequents 
[* > F*(1) and F*(k), A* > F*(k+1), where k is an arbitrary numeral. 
(Possibly not yet all permitted applications of (#, ) are made here.) From the 
assumption that a reduction process exists for the sequents T > F(1) and 
F(a), A > F(a+1), it follows that we have also reduction processes for 
* > F*(1) and also for F*(k), A* > F*(k+1), where k is any numeral. 
(It is assumed that the variable a does not occur in I, A, F(1), F(t).) 

If now n is 1, then the sequent [*, A*  F(n) is either identical with 
I* — F(1) or obtainable from it by rule (a2). If n is the successor of m, 
then *, A* > F*(n) is obtainable from [* > F*(1) and the m sequents 
F*(k), A* + F*(k-+1), wherein k is successively 1, 2, ..., m by applications 
of cut, together with rule (a;). 

Thus the only thing still to be shown is that also cut has the property 
that if for both premises [ — A and A, A— B we have a reduction pro- 
cess, we can get from them a reduction process for the conclusion T’, A> B. _ 

To show this, we begin with a remark about free variables. A reduction of 
the sequent T, A > B must begin by replacing the free variables by numerals. 
By such replacements (arbitrarily chosen) I, A, B becomes I, A, B, and 
by making these replacements for those free variables in A which occur 
in the sequent , A > B, and adding replacements for the other free variables 
(if any) in A, the formula A becomes A. Since by assumption we have 
reduction processes for the sequents Tf — A and A, A > B, we also have such 
processes for the sequents IT >A and A, A+B resulting from the former 
by reduction steps («,). The proof will be complete if from these reduction 
processes we can obtain a process for the sequent i, A> 8. 

There is no loss of generality in assuming from the beginning that the 
sequents T > A and A, A > B (therefore also T, A > B) contain no free 
variables. Likewise there is no loss of generality in assuming that all terms 
in the sequents are computed so that no other terms occur than numerals. 

First consider the case in which the formula A contains no logical symbols 
and hence is a numerical equation. Notice two things: 

1. From a reduction process for a sequent S we can immediately obtain 
a reduction process for any sequent obtained from S by adding other 
formulas to the antecedent. 

2. From a reduction process for a sequent S which has a true numerical 
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equation Q in the antecedent, we can immediately get a reduction process 
for the sequent obtained from S by deleting the formula Q from the antece- 
dent. 

If now in the sequent A, A > B for which we have a reduction process, 
A is a true numerical equation, it follows from the second remark that we can 
also obtain a process for the sequent A— B, and, by the first remark, also for 
the sequent T’, A > B. If A is a false numerical equation, then by applying 
reduction steps (B,), (Bz), (B3) to the sequent I, A > B we come either 
to a final form; or to a sequent T, A > Q, where Q is a false numerical 
equation; or to a sequent C, I, A > 1 = 2. By hypothesis we have a reduc- 
tion process for T > A; and by the first remark above also for, A >A 
and C,l,A— A. But since Q and 1 = 2, like A, are false numerical 
equations, the process for T, A > A is also a process for T, A > Q, and the 
process for C, T, A > A is also a process for C, T, A> 1 = 2. 

Hence, if A is a numerical equation, we can get from reduction processes 
for the sequents [ > A, A, A—>B a reduction process for [, A —> B. 
In order to prove the same thing for a cut with an arbitrary A (supposing 
only that in the sequents of the cut all terms are numerals), we proceed by 
reducing successively the number of logical symbols in the formula A, 
which Gentzen calls the mix-formula (Mischformel) of the cut. So we have 
to show that we have a method for getting from reduction processes for the 
premises of a cut schema (with no other terms than numerals) a reduction 
process for the conclusion, provided that we know already how to do this 
for any cut schema whose mix-formula has fewer logical symbols than the 
one of the cut schema under consideration. This proviso we shall refer as 
to ‘the inductive hypothesis’. 

We begin by paralleling the reduction of the sequent A, A > B. As long 
as the reduction steps do not involve the antecedent formula A, each such 
step can likewise be applied to the sequent I’, A — B. If by these steps the 
sequent A, A —> B is reduced to a final form, the same holds for, A > B. 
Otherwise there is in the reduction of A, A > B a first step which involves 
the mix-formula A. This step must be of one of the kinds (y,), (v2), (Ys); 
and the sequent to which it applies has the form A, A* > Q, where Q is a 
false numerical equation. The mix-formula A has one of the forms 
(Vx)F(x), C&D, -C. 

In case A is a formula (Vx)F(x), the reduction step under consideration 
is of the kind (y,) and it yields either a sequent F(k), A* > Q, where k 
is a numeral, or a sequent A, F(k), A* > Q. If the numeral k in F(k) 
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occurs as a part of a term which contains a function symbol, we have to 
apply one or more steps (a), by which F(k) becomes a formula F,, all 
terms of which are numerals. Hence in the reduction of A, A ~ B we come 
either to F,, A* > Q (Case 1) or to A, F,, A* > Q (Case 2), whereas in 
the reduction of [, A > B we have at this stage arrived at [, A* > Q. 

In case 1 we are almost at our goal. For in the reduction of the sequent 
I — A the first steps can be chosen so as to turn it into T > F,. And the 
reduction process for T > A gives a process for T > F,. Likewise the 
process for A, A > B contains a process for F,, A* — Q. And since in the 
formula F, the number of logical symbols is smaller than in A, we can 
apply the inductive hypothesis according to which we know already how 
to obtain from the reduction process for TF > F, and F,, A* > Q a reduc- 
tion process for T, A* > Q which, combined with the reduction steps 
already used to get from I’, A > B to I, A* = Q, yields a reduction process 
for the sequent T, A > B. 

In case 2 we have a reduction process for the sequent A, F,, A* > Q, 
and the reduction of T, A > B has again arrived at the sequent I’, A¥ > Q, 
Let us now provisionally assume that from the reduction process for TF > A 
and A,F,,A*—>Q we have obtained a process for the sequent 
T, F,, A* > Q which results from the two of them by cut: then we have 
likewise a process for F,, P, A* > Q; and from this together with the process 
for [ > F, (already used in case 1) we obtain according to the inductive 
hypothesis a process for T, 1, A* > Q, and hence also for T, A* > Q. 
which again together with the performed reduction of T,A—-B to 
I, A* > Q yields a reduction process for T, A > B. So in case 2 our goal 
is attained upon the provisional assumption. 

Before discussing this assumption, let us consider the other two possible 
forms of the mix-formula A. If A is a conjunction C & D, almost nothing 
is changed, except the formula F, is replaced by one of the conjunction 
members. When A is a formula — C, the reduction step to be applied to 
A, A* > Q makes this sequence either into A* > C (case 1) or into 
A, A* > C (case 2). And in the reduction of the sequent T > A the first 
step makes it into C, FF > 1 = 2. 

Now in the first case, since C has fewer logical symbols than A, from the 
reduction process for the sequents A* — C and C,l > 1 = 2 we obtain 
by the inductive hypothesis a process for the sequent A*, Il > 1 = 2, 
and hence also a process for Fr, A* > Q, which, together with the reduction 
steps leading from I, A> B to T, A* > Q yields a process for T, A > B. 
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In case 2 the provisional assumption is that from the reduction processes 
for T > A and A, A* > C we get a process for the sequent T, A* > C 
(which results from the two of them by the cut schema). This process 
together with the one for C, I > 1 = 2, yields, according to the inductive 
hypothesis, a process for I, A*, I > 1 = 2, and hence for I, A* > Q, 
and so again for’, A > B. 

Now it remains to consider the provisional assumption which, for all 
forms of the mix-formula syas that from the reduction processes we have 
for [ +A and for a sequent A, A, ~ B,, we can get a process for 
IT, A, > B,. It might first seem that nothing has been gained by replacing, 
in the assertion to be proved, the sequent A, A > B by A, A, > B,. But 
the last sequent occurs in the reduction of A, A > B, and its reduction is 
the part of the reduction of A, A > B which follows the steps leading from 
A, A > B to A, A, > B,. Thus the replacement means progressing in the 
reduction of A, A > B, and therefore it can take place only a limited 
number of times. This argument concludes the consistency proof. 

Concerning this proof of Gentzen’s one might ask in what respect it 
transgresses the methods formalizable in the formal system under considera- 
tion, as must be the case by the Gédel incompleteness theorem. Gentzen 
himself gave the answer by stating that it is in the concept of a reduction 
process that the transgression comes about. Indeed this concept involves 
universal quantification over free choice sequences, and this quantification 
occurs not only in assertions, but also in hypotheses. 

The concept of a reduction process is also introduced in Gentzen’s 
Annalen paper, but there it is not properly used; it is rather replaced by the 
more elementary concept of a reduction step applied to a derivation 
(‘Reduktionsschritt an einer Herleitung’). Whereas in the concept of a 
reduction process the requirement of terminating after finitely many steps 
is involved, in the Annalen paper the ending of the procedure is proved, 
as you know, by assigning ordinals below &9 to the derivations, next showing 
that every reduction step on a detivation lowers its ordinal, and finally 
proving ordinal induction up to &. 
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BURTON DREBEN AND JOHN DENTON 


Introduction. In this paper we outline an approach for studying consistency 
stemming from ideas of Herbrand. This approach ‘finitistically’ exploits 
the oldest and most naive idea in proof theory: a set of axioms is consistent 
if it has a model. Hence it is to be contrasted with that approach initiated 
by Gentzen in 1938 [7] and continued by Schiitte [18, 19, 20] in which proofs, 
that is, formal derivations, are subject to various purely syntactic manipu- 
lations, and questions of interpretation play little role. Indeed, the Herbrand 
approach is perhaps best viewed as a reformulation of Hilbert’s evaluation 
method, a reformulation that frees that method from its customary (and 
in our opinion obfuscating) dependence on the e-calculus [2, 12, 13]. 

The basic result on which we depend is the fundamental theorem of Her- 
brand, the finitistic correlate of the Lowenheim-Skolem Theorem [10]. The 
fundamental theorem says, roughly, that a set of axioms is consistent (in 
any standard formulation of the predicate calculus) if and only if there exist 
arbitrarily large ‘finite approximations’ to models. (A precise statement of the 
theorem is given below.) There are several preliminary comments worth 
making. First, the proof of the fundamental theorem is constructive, in the 
sense that given a proof of a contradiction from the axioms, we can effecti- 
vely show a point beyond which the construction of the ‘finite approximations’ 
to models becomes impossible, and conversely. Second, because of the na- 
tural model theoretic interpretations of the systems studied, under this ap- 
proach we can often use our intuition of what a model of a given set of 
axioms ‘should be’ to suggest ways of showing the consistency of the axioms. 
Third, the methods used to prove consistency generally give as immediate 
corollaries properties such as the recursive satisfaction of AE-formulas and 
the Kreisel no-counterexample interpretation, properties for which an ad- 
ditional argument is necessary under the Gentzen-Schtitte approach. On 
the one hand, this means that in the case of theories which have been shown 
consistent but for which a no-counterexample interpretation is not known, 
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it is presumably more difficult to prove consistency by this approach than 
by the pure syntactic approach; on the other hand, if we do succeed in prov- 
ing consistency by this approach, we are likely to obtain considerable ad- 
ditional information. 

The main body of this paper is divided into three parts. In the first, we 
give a precise formulation of the fundamental theorem of Herbrand. In 
the second part, we sketch how the method can be used to give a consistency 
proof for elementary number theory, and how the recursive satisfaction of 
AE-formulas and the no-counterexample interpretation follow immediately 
from the proof. Finally, in the third section we list some additional results 
and open questions. 


1. Statement of the fundamental theorem. We follow to a large extent the 
notations and terminology of Shoenfield [21]. We have a first order language 
with symbols —, v, and 4, variables, and for each n, n-ary function and pre- 
dicate symbols. In particular, we suppose that for each n, the language 
contains infinitely many n-ary function symbols. We use x, y, Z, w as syn- 
tactical variables for formal variables; f, g, for function symbols; p, g, for 
predicate symbols; and e for constants (0-ary function symbols). Terms are 
built up from variables and constants using function symbols; we use a, b, 
c, d as syntactical variables for terms. An atomic formula is an expression 
of the form pa, ...a,, where p is n-ary; formulas are built up from the ato- 
mic formulas using —, v, and 3. Free and bound occurrences of a variable ina 
formula are defined in the usual way. We write b,[a] for the expression ob- 
tained from 6 by replacing all occurrences of x in b by a, and A,[a] for the 
expression obtained from A by replacing all free occurrences of x by a, 
provided that for each variable y occurring in a, no part of A of the form 
4yB contains an occurrence of x which is free in A. The abbreviations 
(A > B), (A & B), (A @ B), VxA have their usual meanings; we follow 
Shoenfield’s conventions for omitting and inserting parentheses. (Cf. [21], 
pp. 14-18.) 

We depart slightly from Shoenfield in our definition of the logical axioms. 
For us, a /ogical axiom is a formula of one of the forms — A v A or 
A,[a] > 4xA. (Thus, we take identity and equality axioms, where needed, 
as nonlogical axioms.) We adopt as rules of inference the expansion rule 
(infer Bv A from A), the contraction rule (infer A from A v A), the as- 
sociative rule (infer (Av B)v C from Av (BvC)), the cut rule (infer By C 
from Av B and — Av C), and the 3-introduction rule (if x is not free in B, 
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infer 4xA > B from A - B). The theorems of a theory are the formulas 
obtained starting from the logical and nonlogical axioms of the theory using 
the above five rules of inference; and a theorem is /ogical if it has a logical 
proof, that is, if it can be thus obtained without nonlogical axioms. A theory 
T is consistent if not every formula in (the language of) T is a theorem.(Cf. 
[21], pp. 4, 20-22.) 

We now introduce some notions not occurring in Shoenfield. An oc- 
currence of an (existential) quantifier in a formula is restricted if and only 
if it lies within the scopes of an even (possibly zero) number of negation sym- 
bols; an occurrence that is not restricted is general. (In languages taking 
the universal quantifier as a basic symbol, occurrences of universal quan- 
tifiers are restricted or general according as they lie within the scopes of an 
odd or even number of negation symbols; for formulas in prenex form, the 
existential quantifiers are restricted and the universal quantifiers are general.) 
We obtain a validity functional form of a formula by the following steps: 

(i). replace the free variables of the formula by new and distinct constants, 
and rewrite the bound variables so that no two occurrences of quantifiers 
contain the same variable; 

(ii). delete each general quantifier of the resulting formula, and replace 
the occurrences of the variable quantified by it with occurrences of the term 
fx, ..-%X,, where x,,..., x, are the variables quantified by restricted quan- 
tifiers within whose scopes the general quantifier lies, and fis an n-ary func- 
tion symbol not previously used. We obtain a satisfiability functional form 
similarly, except that the restricted quantifiers are deleted instead of the ge- 
neral quantifiers, and the argument places of the new function symbols are 
filled by the variables quantified by geneial quantifiers within whose scopes 
the corresponding restricted quantifier lies. Thus, if 4 is a quantifier-free 
formula with free variables x, y, z, w, then a validity functional form of 
IxVyizA would have the form 4x3zA,, ,[e,, f,x], while a satisfiability 
functional form of 3xVyizA would have the form VyA,, . -[€;, €2, foy]. The 
new function symbols introduced into a validity or satisfiability functional 
form are called indicial function symbols. 

We also need the notion of the expansion of a formula over a domain. 
A domain is a finite nonempty set of variable-free terms; and the height of a 
domain is the maximum height represented in it, where the height of a con- 
stant is 0 and the height of a term fa, ... a, is 1+the maximum of the heights 
of the terms a,,...,a,. The expansion of a formula A over a domain D, 
written &(A, D), is defined by induction as follows: 
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(1). if A is an atomic formula, then &(A, D) is A; 

(2). if A is — B, then &(A, D) is — @(B, D); 

(3). if A is BV C, then 6(A, D) is &(B, D)v &(C, D); 

(4). if A is xB, and &(B, D) is C, then &(A, D)is C,fa,]v C,[a,]v ...v 

C,[a,], where a,,...,a@, are all the elements of D in some order. 

It is easy to see that for any formula A and any domain D, &(A, D) isa 
quantifier-free formula whose free variables are exactly the free variables of A. 
Moreover, if all of the quantifiers of A are restricted (in particular, if A is a 
validity functional form of some formula), then &(A, D) is truth-function- 
ally equivalent to the disjunction of the formulas obtained by deleting the 
quantifiers of A and replacing variables quantified by them with members of 
D in all possible ways. (The dual of the above statement also holds, in 
which ‘restricted’, ‘validity’, ‘disjunction’, are replaced by ‘general’, ‘satis- 
fiability’, ‘conjunction’ respectively.) We will refer to an expansion of a 
validity functional form of a formula A as a validity expansion of A, and 
an expansion of a satisfiability functional form of A as a satisfiability expan- 
sion of A. It should be noted that none of the above definitions or assertions 
are limited to formulas in prenex form. (The formulas Ay and Ag defined by 
Shoenfield on pages 53 and 56 are validity and satisfiability functional forms 
respectively of closed prenex formulas A.) 

We can now state the fundamental theorem of Herbrand [10]: 


(a). There is a uniform way to find (primitive recursively) a tautologous 
validity expansion for any logical theorem A from any logical proof of A. 

(b). There is a uniform way to find (primitive recursively) a logical proof 
for a formula A from any tautologous validity expansion of A. 


As an immediate corollary we have the Herbrand consistency theorem: 


A theory all of whose nonlogical axioms are closed (i.e., contain no free 
variables) is consistent if and only if every satisfiability expansion of each 
(finite) conjunction of its nonlogical axioms is truth-functionally satisfiable. 


For expository convenience, both the fundamental and the consistency 
theorem have been stated in terms of Shoenfield’s formulation of the pre- 
dicate calculus. But, of course, strictly analogous theorems can be stated for 
every (standard) formulation of the predicate calculus. And with respect to 
several of these formulations, the fundamental theorem can be thought of as 
a more general version (more general because the constituent formulas in 
the end sequent need not be prenex) of Gentzen’s verschdrfter Hauptsatz [6]. 
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Interestingly enough, though, part of the contrast between what we have 
called the Herbrand and the Gentzen approaches to proof theory can already 
be seen in the quite different kinds of argument that Herbrand and Gentzen 
give for this Herbrand—Gentzen theorem. Herbrand’s argument does not 
turn on syntactic manipulations, and hence is readily applicable to practical- 
ly all formulations of the predicate calculus. Herbrand’s primary concern 
is to establish part (a) of the fundamental theorem. Thus, in terms of Shoen- 
field’s system, the core of the argument is the construction of an analyzing 
function for the cut rule, that is, a five-place primitive recursive function 
o(i, j, k, p, ¢) with the property that, for all i,j, k, p,q = 1, if formulas 
A, B, C, contain i, j, and k occurrences of quantifiers respectively, and if the 
formulas Av B and — AVC have tautologous validity expansions over 
domains of heights p and gq respectively, then the formula Bv C has a tau- 
tologous validity expansion over a domain of height o(i, j, k, p, 7). The rest 
of the argument for part (a) is simple. Indeed, for each of Shoenfield’s other 
four rules of inference the identity function can serve as an analyzing func- 
tion, that is, if A is any formula having a tautologous validity expansion over 
a domain of height p and the formula B comes from A by one application of 
the expansion, contraction, associative, or d-introduction rule, then B also 
has a tautologous validity expansion over a domain of height p. Finally, for 
each j > 1 and each term a of height A > 0, if a formula A contains j oc- 
currences of quantifiers, then the formula A v — A has a tautologous validity 
expansion over a domain of height j, and the formula A,[a] — 4xA has a 
tautologous validity expansion over a domain of height j+h. 

A detailed proof of the fundamental theorem as well as a discussion of its 
relation to the verscharfter Hauptsatz and the Hilbert-Bernays e-theorems 
will appear in our forthcoming monograph with Scanlon [4]. (The proofs 
Shoenfield gives in pages 41-55 of what he calls the consistency theorem and 
Herbrand’s theorem are adapted from the proofs in Hilbert-Bernays [13] of 
the ¢-theorems. It is important to realize, as Hilbert and Bernays do, that 
these proofs are of a piece with the Gentzen approach but not with the Hil- 
bert evaluation approach.) 

To see the way in which the fundamental theorem (more precisely, its 
corollary the consistency theorem) reduces the problem of consistency to 
the construction of ‘finite approximations’ to models, let us consider the 
case of a theory whose nonlogical axioms A; are closed and of the form 
Vx,...Vx,4yA;, where A; is quantifier-free. The satisfiability functional 
forms A‘ of such formulas are of the form Vx, .. Ee Len ee, |e 
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and a satisfiability expansion of a conjunction of such formulas will be a 
conjunction of formulas of the form 


A peel, poe OOD) An» fiay CO 0 Dont (1) 


The fundamental theorem tells us that the theory will be consistent if and 
only if every finite conjunction of the formulas (1) is satisfiable. 

From a nonconstructive point of view, of course, the assertion that every 
finite conjunction of formulas (1) is truth-functionally satisfiable is equiva- 
lent to the assertion that the theory has a model, i.e., that there is a structure 
in which all formulas (1) (for every i and any terms a,,..., a, of the theory) 
and hence all the axioms of the theory are satisfied. (The connection is made 
via the ‘infinity lemma’ or the ‘law of infinite conjunction’.) But there are 
theories (e.g. elementary number theory) for which one cannot provide a 
model in any constructive way (i.e., a way which would enable us to deter- 
mine effectively the truth values of formulas of the theory in that model), 
but for which one can provide a general method which enables one, given 
any finite conjunction of formulas (1), to construct a truth assignment under 
which these formulas are all true. The various truth assignments provided 
by this general method give us information about the theory (and its theo- 
rems) which is not provided by a model presented in a nonconstructive way. 

As an example of such additional information, suppose that a closed for- 
mula of the form VziwB, B quantifier-free, is a theorem of a theory whose 
nonlogical axioms A; are as above. Then, by the deduction theorem, a for- 
mula of the form 


(A, & A, &... & A,) > VzdwB (2) 


is a theorem of a theory with no nonlogical axioms. Now a validity function- 
al form of (2) will be of the form 


(AP &... & A®) > IwB fe], 


where 4! is a satisfiability functional form of A;. (Note that an occurrence 
of a quantifier is restricted in A; if and only if the correspondingly placed 
occurrence of the quantifier is general in (2)). A validity expansion of (2) 
will then have the form 


(8(A®, D) &... & &(A®, D)) + &(4wB[e], D). (3) 


It can easily be shown that if a formula has a validity expansion which is 
a tautology, then it has such an expansion over a domain whose terms are 
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built up from the constants and function symbols occurring in its validity 
functional form (provided that at least one constant occurs in the functional 
form). Thus, in the present case, we may suppose that the domain D con- 
sists of terms built up from the constant e using contants and function sym- 
bols occurring in the axioms A,, together with the indicial function symbols 
f;. Uf we have a model . for the functional forms A$, and we assign e any 
value in the universe of ., then we can compute relative to the model the 
values of the terms in D. Since the expansions &(4$, D) all hold in», and 
(3) is a tautology, it must be that for some ain D, B, ,,[e, a] holds in #. 
Thus, for each element e,, of the universe of .o%, we can effectively find an 
element a, such that B, ,,[e, a] holds in 7, where ey and a, are the elements 
denoted by e and a respectively. In the case of number theory, this gives us 
essentially the recursive satisfaction of AE-formulas. 


2. The consistency of elementary number theory. As one might expect, the 
Herbrand approach to problems of consistency is most easily applicable 
to those theories for which we have a fairly clear idea of what a model should 
be. An extremely simple example of this is afforded by Shoenfield’s theory V 
on pages 22 and 51 of [21], or by any other formalization of (elementary) 
number theory without induction in which no indicial function symbols oc- 
cur in the functional forms of the axioms. Here the elements of the model 
will be the natural numbers. Hence, to specify a suitable structure ~ (see 
[21], p. 18), we need only assign in the obvious way numbers, computable 
functions, and decidable relations to whatever functions and relation 
symbols occur in the axioms. For example, to 0, S, +, °, =, and <, the 
primitive symbols of N, are assigned respectively the number 0,,, the suc- 
cessor function S,, the sum +,,, the product -,,, the identity relation 
=, and the less than relation <,,. Clearly, 7 is an (effective) model for 
the axioms, and in particular for the satisfiability expansion of any conjunc- 
tion of closures of the non-logical axioms over any domain D that consists 
just of terms built up from the constants and function symbols occurring 
in the functional forms of the axioms. (It follows from the remark in the last 
paragraph of section | that only such domains need be considered; henceforth 
we shall tacitly restrict our attention to them.) By the consistency theorem, 
elementary number theory without induction is consistent. 

Now let us extend N, or any analogous theory, by adding induction. First, 
consider an induction axiom of the form 


¥2((A,[0] & ¥x(A > A,[Sx])) > VxA), (1) 
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where A is a quantifier-free formula whose free variables are z and x. A 
satisfiability functional form of the closure of this formula will be of the 
form 


V2((A,{0] & (4,0 fz] > A,[Sf2])) > Vx), (2) 


where / is the indicial function symbol corresponding to the restricted quan- 
tifier Vx appearing in the antecedent of the conditional. Intuitively, we want 
f to stand for a number theoretic function with the property that if for some 
number k, VxA,[k] is false and A, ,[k, 0] is true, then A, ,[k, fk] is true and 
A, ,{k, Sfk] is false (for example, by letting fk be the least number n such 
that A, ,[k, Sn] is false). But there is no way in general of deciding effectively 
whether VxA,[k] is (number theoretically) false. So we cannot proceed as 
we did in the case of the system without induction. It is at this point that the 
full power of the fundamental theorem comes into play. Namely, to show 
the consistency of the induction axiom (1) with the other axioms of number 
theory, it suffices to show that for each domain D there is a structure .7(D) 
in which the satisfiability expansion &((2), D) (and the expansions of the 
other axioms) is true. In this structure ./(D), we do not have to insure that 
Ajfk, fk] is true and A[k, Sfk] false unless, for some term a belonging to 
D, Afk, a] is false in »/(D) (and A[k, O]is true). That is, if the least number 
n such that A[k, Sn] is false is ‘too large’ in relation to D, then we do not 
have to take it into consideration. (Here and frequently below, we simplify 
the substitution notation by omitting subscripts.) 

Now for each D, an (effective) structure ./(D) that satisfies the expan- 
sions over D ofall the other axioms is obtainable from the structure .- merely 
by assigning any computable function « to the function symbol f. Of course, 
distinct @’s will normally be picked for distinct D’s. Our whole task is to 
find for a given D an « such that (D) satisfies &((2), D) as well. More 
precisely, for each domain D and function «, let m be a mapping of the terms 
of D into the numbers such that 


TA) = De: m(a: b) = m(a) + m(b) 
m(Sa) = S,,m(a) m(fa) = a(m(a)) (3) 
m(a+b) = m(a) + 4 m(b) 
Each such mapping will be said to induce a structure (D). To induce a 


model for &((2), D), we proceed by successive approximation. 
We begin by trying the function a such that %9(k) = 0. for all numbers k. 
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Assuming that «; has been defined for some i, we define m; by conditions 
(3), using «; in place of «. We then let .o; be the structure induced by m,. 
If the expansion &((2), D) is true in .7;, then clearly we are done; if &((2), 
D) is false in .°7;, then it is because for some term a belonging to D, A[a, 0] 
and A[a, fa] + A[a, Sfa] are true in .7;, but for some b in D, Alfa, b] is 
false in o7;. Now we have m,(b) # 0, since if m,(b) = 0,4, then A [a, 0] 
and A[a, b] would have the same truth-value in .7,;. Thus, our difficulty 
comes from the fact that we have not chosen m,(fa) = «,(m(a)) in such 
a way that A[a, fa] is true in .°7; but A[a, Sfa] is false in .7;. However, the 
remedy is clear: Since A[m,(a), 0,4] is true (over the numbers) but A[7,(a), 
m,(b)] is false, we can find a number n < m,(b) such that A[m,(a), n] is 
true but A[m,(a), S,,n] is false. Hence, we define the function «;,, as fol- 
lows: «;4,(k)=«;(k) for all numbers k except k = m,(a), where a is as above, 
in which case we define «;,,(k) =n, where nis as above. We define m;,, by 
conditions (3), replacing « by a;,,, and we let ;,, be the induced struc- 
ture. 

Clearly, «;,, is a better approximation to the desired function than is 
%,; our problem now is to show that by repeating the above procedure suf- 
ficiently many times, we will eventually find a structure in which &((2), D) is 
true. The first observation to be made is that for alli, non-zero values of «; are 
‘correct’, more precisely, if «;(k) # 0, then A[k, «,(k)] is true and A[k, 
S.¢%;(k)] is false. This implies that we can measure progress towards our goal 
by counting the number of terms a belonging to D such that m,(fa) = 0,; 
the fewer of these there are, roughly speaking, the closer we are to our goal. 
In calculating this measure, however, we have to give priority to terms of low 
height. The reason for this is that in going from «; to «;,, we may have chang- 
ed the value of m,(fa), where the height of a is relatively low, and in so doing 
we may change the value of m,(fb), when a occurs in 4, in unpredictable 
ways. An appropriate way to measure progress is the following: We let 
h(i, p) be the number of terms a in D of height p such that m,(fa) = 0,7, 
and we let index (i) be the py + I-tuple (H(i, 0), . . ., A(i, po)), where po is the 
height of D; we write index (i,) < index(i,) if and only if, for some p (0 < 
(Peep, ewe have iO) = 1G>,0) aan aieypl))— AG, p=1), and 
h(i,, p) < h(i, p). It is easy to show that if &((2), D) is false in ,, then 
index(i+1) < index(i). Since the relation < is a linear ordering, and since 
the numbers /(i, p) are bounded by qg, where q is the number of terms in D, 
it follows that for some i < (1+q)?°, the satisfiability expansion &((2), D) 
must be true in .%;. The expansions of the other axioms of elementary num- 
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ber theory are true in every ~,;, as we have seen, so we have that for every 
domain D there exists a structure ./(D) in which the satisfiability expan- 
sions of all of the nonlogical axioms of our theory are true. Hence, by the 
fundamental theorem the theory is consistent. (This theory can be shown 
consistent by simpler arguments, and Herbrand and Gentzen have both done 
so; see [9], [6], and also [15]. But such arguments cannot be extended to 
full number theory. Moreover, the argument just sketched for constructing 
W ,is essentially the one used for constructing an analyzing function for the 
cut rule; see [3], [22], and [4].) 

In the proof of the consistency of systems such as N, we were able to use 
the same structure as a model of the satisfiability expansion of all the axioms 
over a domain D, independent of D. When we added induction axioms for 
quantifier-free formulas we could no longer do this, but we were still able 
to construct our successive approximations to models without back-tracking, 
in the sense that once we had found a ‘correct’ value for «;(), we never nee- 
ded to change that value at a later stage in our construction. However, in 
order to prove the consistency of elementary number theory with an un- 
restricted axiom schema for induction, we must abandon this characteristic 
of our construction in order to preserve the principle that ‘non-zero values 
are correct’. Necessarily, this means that we must use more delicate means to 
measure progress towards our goal. It is in establishing that these measures 
do in fact show that we reach our goal after a finite number of iterations that 
we use methods not formalizable in the theory being shown consistent, and 
thus escape the limitations imposed by Gédel’s theorem on consistency proofs, 
Nevertheless, the basic strategy of our construction remains unchanged: we 
map terms occurring in satisfiability expansions into elements of an intuitive 
model so as induce an appropriate structure. 

To illustrate how this is done, consider the case of an induction axiom (1), 
in which we now suppose A to be of the form 4yB, where B is quantifier-free 
and has free variables z, x, y. Then (1) becomes 


Vz((SyB[0, y]&Vx(AyB[x, y] > AyB[Sx, y])) > Vx4yB[x, y]), (4) 


where B[a, b] denotes B, ,[a, b]. Now if we were carrying out the details 
we would see that it is convenient to adjoin a new axiom 


VzVx(4yBLx, y] > BLx, gzx]), (5) 


where g is a new binary function symbol. Clearly the consistency of the sy- 
stem obtained by adjoining this axiom implies the consistency of the ori- 
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ginal system; on the other hand, formula (5) is a satisfiability functional 
form of VzVx(4yB > dyB), and hence its adjunction cannot destroy con- 
sistency. (This has nothing to do with the axiom of choice.) In the presence 
of (5), (4) is provably equivalent to 


Vz((B[0, gz0] & Vx(B[x, gzx] > B[Sx, gzSx])) > VxB[x, gzx]), (6) 
which in turn has as a satisfiability function form 


Vz((BL0, gz0] & (BLfz, gzfz] > B[Sfz, gzSfz])) > VxB[x, gzx]). (7) 


Thus, our problem is to show that any satisfiability expansion of the con- 
junction of (5) and (7) (and the other axioms of number theory) is satisfiable. 
As before, our method is to find a mapping m sending the terms occurring 
in the expansion into numbers in such a way that the expansions are true 
in the induced structure. This mapping is to satisfy the conditions (3), and 
the additional condition m(gab) = B(m(a), m(b)), where B is a binary func- 
tion to be determined by successive approximation. We start by taking 
ao(k) = Bo(k, n) = 0,, for all numbers k and n. Given «; and £;, we define 
m, by conditions like those just stated for m, replacing « and f with a; 
and f;. 

For a given domain D, if the induced structure &, is such that the ex- 
pansions &((5), D) and &((7), D) are true, then of course we have reached 
our goal. Otherwise, there are two alternatives. Either there exist terms a, b, 
cin D making B[a, b, c] true in 7; and Bla, b, gab] false in ;, or there 
exist terms a, b in D such that B[a, 0, ga0] and B[a, fa, gafa] — B [a, Sfa, 
gaSfa] are true in &; but Bla, b, gab] is false in ;. (Here B[a, b, c] de- 
motcsmaye la, Dyc].) 

In the first case, the difficulty is clearly that m,(gab) = B,(m;,(a), m,(b)) 
has an ‘incorrect’ value which we can change to a ‘correct’ value m,(c). But in 
order to preserve the principle that non-zero values are correct for «;, we must 
make sure not to pass on a non-zero value of «; which has become ‘incorrect’ 
by virtue of this change in B;. We take care of this by a seemingly brutal 
procedure: If we go from f; to B;,, by correcting an ‘incorrect’ value, then 
we define «;,,(”) to be zero for all numbers 7, thus, so to speak, erasing all 
non-zero values of «;. Although this might appear to be a step in the wrong 
direction, it will turn out that if we think in terms of sequences of maps, we 
can still devise a measure of progress with the right properties. We turn now 
to the second of the alternatives. 

If the satisfiability expansion &((7), D) is false, then it is because m,(fa) 
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= a,(m,(a)) has an ‘incorrect’ value. Hence we define «;4,(k) = «;(k), 
except when k = m,(a); for this k, we take «;,,(K) to be the least number n 
such that B[m,(a), n, B,(m;(a), n)] is true and B[m,(a), S.n, B;(m,(a), S,n)] 
is false; such a number exists and is smaller than m,(b). We also set B;+,(k, 1) 
= B,(k, n) for all k, n. We have not violated the principle ‘non-zero values 
are correct’ in so doing, since the function symbol f does not occur in 
B[z, x, y] > Blz, x, gzx], the formula serving as the criterion for the ‘cor- 
rectness’ of values of f;. 

The method by which we devise measures of progress in order to conclude 
that the above procedure eventually gives us a model for the expansions 
&((5), D) and &((7), D) is essentially that of Ackermann’s 1940 consisten- 
cy proof [1] (see also Wang [23], pp. 362-375) adapted to Herbrand’s 
language. The formulas (5) and (7) correspond to critical formulas of the 
first and second kinds respectively. The use in our approach of function sym- 
bols instead of e-terms makes it easier to see what is going on. Briefly, what 
happens is that when we ‘erase’ all non-zero values of «;, we eventually re- 
place them with at least as many new ‘correct’ values (correct according to 
the new definition of B;), and in this way we eventually definea mapping m; 
inducing a structure in which &((5), D) and &((7), D) are true. Ordinal 
numbers are assigned to the pairs of functions («;, B;), finite sequences of 
such pairs, finite sequences of finite sequences of such pairs, etc, (Note that 
these functions, sequences, etc., have values different from zero for only 
finitely many sets of arguments, and hence can be coded into numbers.) 
The construction sketched above leads to decreasing sequences of ordinals 
until a model for the expansion is reached; since a decieasing sequence of 
ordinals must terminate after finitely many terms, we have our desired con- 
clusion. The ordinals employed are constructive, and the bounds on the 
size of the ordinals involved are essentially the same as those found in the 
Gentzen-style proofs. Although the bounds increase as a function of the 
number of quantifiers occurring in the formulas appearing in the induction 
axioms, the conceptual difficulties do not increase; only the bookkeeping 
gets more complicated. 

As we mentioned in the introduction, the recursive satisfaction of AE- 
formulas and the no-counterexample interpretation result quickly from our 
consistency proof. The remarks at the end of section 1 should suffice to 
indicate how recursive satisfaction is obtained; we turn now to the Kreisel 
no-counterexample interpretation. Let us consider a theorem of elementary 
number theory of the form Vx4yVzB[x, y, z], where B[x, y, z] is quantifier- 
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free and has only the indicated free variables. By the deduction theorem, we 
have a logical theorem of the form 


A, &...&A, > VxayVzB[x, y, z], 


where the formulas A; are closures of axioms of number theory. A validity 
functional form of this formula will be of the form 


A® & ... & AS — JyBle, y, fy], (8) 


and by the fundamental theorem we can find a domain D such that 
&((8), D) is a tautology. Thus, if we have a structure in which each of the 
&(A®, D) is true, then for some a in D, Ble, a, fa] must also be true, since 
&((AyBle, y, fy]), D) is a disjunction. Now suppose someone comes to us 
and says: ‘I have a number n and a function @ such that for all numbers k, 
B[n, k, o(k)] is false.” Using the procedure described above, we can find a 
mapping m which maps the terms occurring in &((8), D) into numbers, maps 
e ton, maps fa to y(m(a)) for all terms a, and is such that the mapping m 
induces a structure in which &(A$*, D) is true for i= 1,...,r. In this 
structure, B(e, a, fa] is true for some ain D, and hence B[n, m(a), p(m(a))] 
must be true. Thus we have been able to defeat the proposed counterexample 
to the truth of Vx4yVz B[x, y, z]. It is easy to make this argument completely 
general; it will apply to any consistency proof in which we map the terms 
occurring in a satisfiability expansion of the axioms so as to induce a struc- 
ture in which the expansion Is true. 


3. Further results and prospects. The fundamental theorem in conjunction 
with mapping techniques which induce suitable structures has proved useful 
for investigating solvable cases of the decision problem (e.g., see [5]), as 
well as for clarifying Kreisel’s primitive recursive solution to Hilbert’s 
seventeenth problem [14]. 

In addition, Scanlon has extended the argument sketched in section 2 to 
provide a proof of the w-consistency of elementary number theory [4]. 
This proof is of the no-counterexample kind discussed by Kreisel in [16]. It 
is shown that given a proof of a formula 4xB and a number theoretic func- 
tion w, we can find a natural number » such that if D is a domain of height 
W(n), then the validity expansion over D of the formula A; &... & A, > 
—B,,[S‘”0], where the axioms A; have at most (n) quantifiers, is not a tautol- 
ogy. This is the sense in which w-consistency must be taken in order for the 
notion to have constructive content, as Kreisel has remarked. (There is an 
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interesting contrast between w-consistency and simple consistency. Kreisel 
shows in [16] that the w-consistency of number theory without induction 
immediately implies the simple consistency of full number theory. But Par- 
sons has pointed out that the (arithmetical correlate of the) converse is not 
provable in full number theory). 

Candidates for further extensions of the Herbrand approach are the various 
subsystems of analysis and type theory which have already been shown con- 
sistent by other methods. An obvious candidate is the so-called ramified 
analysis in which there are different levels of sets, and the comprehension 
axioms for the sets of each level contain only quantifiers for sets of lower 
levels. It seems clear that a consistency proof along the lines of section 2 will 
involve great combinatorial complexities, for any such proof will afford much 
insight into ramified analysis. But the finding of such a proof should perhaps 
be taken as a touchstone for the likelihood of further useful results from the 
Herbrand approach to consistency, especially since Parsons establishes in 
[17] the w-consistency of ramified analysis, and Kreisel states that Gddel’s 
consistency proof for number theory [8,21] can be extended to give a no- 
counterexample interpretation for ramified analysis. 
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ITERATED INDUCTIVE DEFINITIONS AND 32-AC* 


HARVEY FRIEDMAN 


In this paper, we discuss the relations between 2}, ,-AC and [1}/-CA, k > 0. 
After that, we discuss 2,-DC and iterated generalized inductive definitions. 
1I,-CA, for k > 0, is just the comprehension axiom (schema) applied to 
Hi} formulae, together with EA. A TH, formula is one which has a block 
of & function quantifiers, starting with a universal one, followed by only 
number quantifiers and propositional combinations of atomic formulae.! 
EA is composed of first-order arithmetic together with the schema of ordi- 
nary induction on all second-order formulae. A Ij formula is a I$ formula, 
for the purposes of this paper. Z,-AC is the schema (Wn)(4/)P(n, f) > 
(Gg)(Vn)P(#, 9c), Where gq)(m) = g(2"** -3"*1), and P is 7,1, together 
with EA, if k > 1. If k = 1, then take P to be arithmetical. 

It is well known that ,,-AC is much stronger than J7,-CA, in the sense 
that Xj,,-AC can prove the existence of an w-model of I{}-CA. This ac- 
tually follows from our theorem. An w-model is one in which the integers 
are standard. Instead, if we consider a natural process of iteration of Hj-CA 
described below, and we do this process &9 times, we get a theory which we 
call (II;-CA)*°, and we have the 


THEOREM. 2,,,-AC is a conservative extension of (I,-CA)*® for II} sen- 
tences ifk >1. If k = 0, then it is a conservative extension for II}, sentences. 
For k = 1, we have II} sentences. 


(A theory T is said to be a conservative extension of S with respect to 7) 
sentences if and only if every 7 sentence provable in S is provable in T and 
T contains S.) 

The case where k=O was treated in the author’s doctoral dissertation [3]. 


* This research was partially supported by NSF G.P. 8764. 
t We use this particular definition of /7} so that we do not need extra axioms to prove 
the existence of a complete /7} predicate; one only needs /7}-CA. 
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The other cases of the theorem, as one can see from this paper, are proved 
by adaptation of the argument for the case k = 0. 

Now we define (f1;-CA)<®. There is a standard complete I; predicate 
P,(n,f, g) of the free variables shown, for each k > 0. We define, for a re- 
cursive linear ordering indexed by e (the field may not be all of ), the pre- 
dicate H7(f, e), k fixed > 0, as 

(Vp)(if p is the least point in the ordering e, thenf,,) = g) & 

(Vp)(if p is not in the field of e then f(,) is everywhere 0)& 

(Vp)(Vq)(if q is the successor of p in the ordering e then 

(Va)(Fa(n) = 0 Prlt, fio) 9) & f(t) = 1 ~ Pa fy 9) & 

(Vp)(if p is a limit then 

(Va)(fip(n) = 0 [n = <k, g> & @ is below pine & f(k) = 0) & 
int) = 1 ~ fy(”) = 0))). 
Thus, when k = 0, and e is a well-ordering, we are describing the hyper- 
arithmetic hierarchy (relative to g). 

Now consider the standard recursive well-ordering of type ¢). The field is 
w. For each n, €)(n) is defined as the natural index of the initial segment of 
the standard ordering, ¢), up to (but not including) 7. &9(”) is a primitive 
recursive function. 

Finally, (7;-CA)*® is I1j-CA together with the schema (Vg)(A/)(H&(S, 
&9(n))), Where n varies through . 

This theorem has special significance in the cases k = Oand 1. Fork = 0, 
Zi-AC is, on the face of it, impredicative, further corroborated by the theo- 
rem of Kreisel that the minimum w-model of ¥}-AC is the collection of all 
hyperarithmetic functions (sets). But this theorem shows that the I 5 state- 
ments provable in 2;-AC can be predicatively proved. Furthermore, the 
proof of this metatheorem can be easily made strictly finitary. 

In the case of k = 1, there is a completely straightforward direct trans- 
lation (which we refer to later) of (IT,-CA)** into (and onto) the theory of 
iterated inductive definitions, where the iteration is carried through & times. 
This system ID** will be described later. Our theorem for k = 1 gives (a 
finitary proof) that 23-AC and ID<" have the same JT; theorems. This will 
be made more precise later. Of course, finitary relative consistency proofs 
are immediately obvious from the finitary proofs of conservative extension 
results. 

Rather than first give the most complicated argument that produces the 
most refined results (namely the whole of the theorem), we successively give 
sketches of more and more sharpened versions, 
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We define (17,-CA)® as (Vn)(Vg)(SS)(AR(S, e9(n))) + 15-CA. Note that 
this addition to 7j-CA is not a schema. 
We first prove 


THEOREM 1. Any Z} sentence provable in Z;—AC is provable in T =(II1j-CA)° 
+ all instances of transfinite induction on our standard ordering, &, for 
formulae of complexity < 29. 

Our proof uses a basic lemma from proof theory, due to G. Kreisel [4] 
that follows from Gentzen’s work. 


LemMA |. Let A be any consistent axiom system in the language of 2nd order 
arithmetic that contains EA and such that the set of all axioms in A minus the 
axioms in EA form a set of sentences of bounded complexity. Then there 
is an instance of transfinite induction on our natural ordering of & which is 
not provable in A. In fact, the predicate which the unprovable (in A) trans- 
finite induction is applied to can be taken to have only one free variable, the 
number variable on which the transfinite induction is being carried out. More- 
over, this is best possible, in the sense that for each predicate Fn, (with possibly 
more free variables) and each m, we have that Tl(é9(m), Fn) is provable in 
EA. 


Unless explicitly stated otherwise, the only facts we shall use about the 
ordering & are 1) that the ordering is given by some recursive index e, 
2) that, using e, lemma | holds. Hence, the theorem will hold for any ordering 
R, indexed by some eé satisfying these conditions, in place of ¢,. Any R sat- 
isfying these conditions must be a well-ordering of type < & . 

Proof of theorem 1. We will suppose that we have a IT} sentence P, which 
is consistent with T. We write T+ P for T+ {P}. We have to show that 2;-AC 
+ Pis consistent. We do this by constructing a model of £}|-AC +P, assuming 
Con(T+P). 

We first note that since T+P is consistent, by lemma | we get a particular 
instance of transfinite induction on é9, TI(é); Fr), which is not provable 
in T+P. The induction is thought of as being performed on nv as an element 
of the domain of the ordering &). F will not have other free variables. Now 
consider T+P+ ~TI(€); Fn). We know it is consistent, and so it has a 
model, /. We are adding the negation of an instance of transfinite induction 
on é9. We are therefore considering a model of a false theory of analysis. 

The reader should note that this model M has, necessarily, non-standard 
integers. We do not make use of this fact. We, instead, form a submodel NV 
of M, in the sense that N will have the same ‘integers’ as M, but less 2nd 
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order objects (functions). It this sense, we are using an inner model construc- 
tion, and we are interpreting 2;-AC+P in T+ P+ ~TI(éo; Fn). 

Now let Gn be (Vk)(if k is below n in é) then Fk). We now come to the 
crucial step in the construction. We have ~TI(é), Gn) holding in M. We 
will now form a simple submodel of M which we shall see satisfies 5}~AC +P. 
We take the submodel N to have the same integers as M. The functions in 
N will be the collection of all functions g in M such that M satisfies 


(An)(Gn & (4h)(Ho(h, eo(n)) & g is recursive in /)). 


At this point, we advise the reader to draw a vertical line representing the 
ordinals < &) according to M. Then to note that the with Gn do not have 
a l.u.b. in M. The reader should draw a wavy horizontal line through where 
the l.u.b. ought to have been. P is satisfied in N since N is a submodel of M 
and P is satisfied in M@. N must satisfy EA since N is an inner model of 
M definable by a formula and EA is satisfied in M. To see that N satisfies 
TI5-CA, use that © satisfies that every point in the ordering ¢) has an 
(immediate) successor. This makes use of the transfinite induction on é 
that is part of T. 

We shall now show that N satisfies Y}-AC. Let (Vn)(4g)A(n, g, h), A 
arithmetical, A fixed in N, hold in N. A has no other free variables. 

Now consider the predicate 


Qm  (An){(Ax)(4g)(Ho(a, &o(7)) & g is recursive in « & A(n, g,h)) & 
(Va)(Vg)(Vr) ((Ho(a, éo(r)) & g is recursive in a & A(n, g, h)) 
—+r above m orr = min &))}. 


It is easily seen that M satisfies 


(a!s)(Vt)(((t above s or t=s in &)—> ~Qt)& 
(t below s in & — (4m) ((m above t or m = t) & Qm))), 


since M satisfies TI on é, for sufficiently complicated formulae, and the 
absoluteness of arithmetic properties. From the definition of N, it is clear 
that M satisfies 

(Vn)(n below the unique s > Gn), 


again by absoluteness. But then M satisfies Gs. Now choose ¢ as the first 
limit point in ¢9, 2 sin é,. We can do this in M, by the transfinite induction 
ON & available in T. Then M will satisfy Gt. There is a Z € N such that N 
satisfies H)(Z, é(t)). Then N satisfies 


(Vn)(Ag)(A(a, g, h) & g is recursive in the pair <Z, h)). 
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So, by IIg-CA, N satisfies (4Y)(Vn)A(n, (Y),, 4), and we are done. 
We now use the following refinement of lemma 1. 


LeMMA 2. Let A be any consistent axiom system in the language of 2nd or- 
der arithmetic, containing \st order arithmetic. Furthermore suppose that all 
axioms of A which are instances of ordinary induction, are instances of induc- 
tion applied to formulae of a certain bounded complexity, k. Also suppose that 
the axioms in A ininus the axioms in EA have bounded coinplexity p as in 
lemma 1. Then one can find an instance of transfinite induction on &)(G(k)) 
applied to a predicate of complexity F(p), which cannot be proved in A. G and 
F are primitive recursive functions on natural numbers. As in lemma 1, we can 
restrict the free variables to one in the transfinite induction. 


With this, we can obtain 


THEOREM 2. Any 2} sentence provable in Z{-AC is already provable in (IT}- 
Cale. 

An important point here is that the complexity of the induction is F(p), 
so it depends only on p, not k. The proof of theorem 2 is the same as of 
theorem 1, except that the two systems are broken up into subsystems: Let 
(2;-AC), be the same as 2;-AC, except that in EA, the induction is only 
applied to formulae of complexity k. Let T, be the theory (I75-CA)*™ + all 
instances of transfinite induction on &)(n), for formulae of complexity < 29, 
minus ordinary induction. Let p be the complexity of (J, 7, minus EA. 
(IT o-AC can be axiomatized so that there is this bound p). Then, in the same 
way as theorem 1 was proved from lemma 1, we can prove that every Z{ sen- 
tence provable in (2}-AC), is provable in Tox F(py + Ordinary induction ap- 
plied to formulae of complexity k x F(p), using lemma 2. Now using con- 
dition (2) on the ordering é,, we see that each T, is a subsystem of (I7>-CA)*®, 
and we have theorem 2. The addition of ordinary induction applied to formu- 
lae of complexity k x F(p) comes in when trying to show that the submodel 
satisfies ordinary induction applied to formulae of complexity k. 

At the crucial point in the proof of theorem 1 we took the submodel 
N of M as the functions g € M such that M satisfies 

(dn)(Gn & (4h)(Ho(h, &9(n)) & g recursive in /)). 
We can choose an «€ M and relativize this construction to the g € M such 
that M satisfies 

(dn)(Gn & (4h)(H¢(h, &9(7)) & g recursive in /)). 
In this way, we immediately obtain 
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THEOREM 3. Any IT} sentence provable in X}-AC is provable in (IIj-CA)*®. 
We now prove the analogue of theorem 1 for k = 2. 


THEOREM 4, Any Xj sentence provable in £3-AC is provable in (II{-CA)° + 
transfinite induction on &9 applied to all formulae of complexity < 29, Again 
call this theory T. 

Proor. Keeping the same notation as in the proof of theorem 1, we come 
to the assumption that N satisfies (V)(Ag)A(n, g, h), he N, where A is now 
II. The absoluteness assumption in the proof of theorem 1 was that for 
each g EN, A(n, g, h) holds in N if and only if it holds in M, and that for 
any fe N, M satisfies Ho(f; éo(7)) if and only if N satisfies it. In this case, 
we have this too. The first is guaranteed by the Kleene basis theorem, which 
must hold in M since it is provable in ;-AC. For Kleene’s theorem says 
that the functions recursive in the complete I} set in g satisfy the same 
II} formulae with parameter g as do all the functions. Hence, any 2} for- 
mula with parameters in N that holds in N will hold in M, because for any 
g € N, the complete I7j set in g, according to M, isin N. We can, of course, 
take formulae that have number quantifiers to the left of the function quan- 
tifiers 4, V, and this will still be true. Now, the predicate H,(/, éo(m)) is in- 
deed of this form. Now, for any ge N, we have that the complete I} 
set in g according to M isin N, and is also the complete I/} set according to 
N; also, if H,(f, 9, ()) holds in N for some particular fe N, then it holds 
in M, and vice versa. 

It is an easy matter, using this lifting of 2} formulae from N to M and split- 
ting into subsystems and lemma 2, to get 


THEOREM 5. Any IT} sentence provable in 3;-AC is provable in (II{-CA)**. 


We mention a result of R. Mansfield: 


THEOREM 6. 4,-CA, 23-AC, 23-DC (dependent choice) all have the same 
theorems. 


In the author’s doctoral dissertation, we have 


THEOREM 7, 4;-CA, Z}-AC, Z}-DC have the same IT} theorems, and, in- 
cidentally, the same theorems about hyperarithmetic functions. 


Lema 3. We can prove in II}—CA that every IT} formula with one free variable 
g which holds when the quantifiers are restricted to the functions recursive in the 
complete IT} set in g must hold when interpreted over all functions. In fact, 
we may weaken the hypothesis to insist only that we take the functions A} in g. 
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This follows from a result of R. Mansfield that the Kondo uniformization 
theorem is provable in A,-CA (and hence in JT}-CA). 
So we obtain 


THEOREM 8. Every IT sentence provable inZ3-AC is provable in (II,-CA)*®. 


We let fe L’ be an abbreviation for the natural formalization of ‘fis con- 
structible from g’ in 2nd order arithmetic. Then there is a predicate P(h, J, 9), 
provably 4} in I13-CA, such that we can prove (Vf)(feL’) > P(h, J, 9) 
defines a well-ordering on all funtions, as a relation on A and J,in IT5-CA. 
We obtain 


Lemma 4. We can prove in Iy-CA+(Ag)(VWf)(feL’), k > 2, that (Ag) 
(every 3; formula with free variable h which holds, holds when the quantifiers 
are restricted to functions A; in the pair <h, g)). 


From this, we obtain 
Lemma 5. Every II, sentence provable in Z;,,,-AC is provable in 


(Hi, -CA)-" + Gg)\WAG ee L*), for k > 2. 


In a paper in preparation, The 2nd order comprehension axioms, we will prove 
(among other things) 


Lemma 6. Every II, sentence provable in (IT,-CA)*°+(ag)(Vf)(fe L’) is 
provable in (IIj-CA)*”, k > 2. 


So we have 


THEOREM 9. Every ITj, sentence provable in 5; ,-AC is provable in (II,-CA)*®, 
ia? 


In the paper in preparation, we will also prove 
Lemma 7. Every IT}, sentence provable in 34-DC is provable in Aj-CA, k > 2. 
To finish the THEOREM, we need 


THEOREM 10. Each (II,;-CA)*® is a subsystem of 44 ,-AC. 

Proor. We argue in 2,.,-AC. We let Qn be (Vg)(Af)(AZ(SF, eo())). We 
wish to show that for each n, Qn is provable in 2,,,-AC. Fix an n. Choose 
an r above n in é9. Then r is, provably in 2;,,-AC, above nin é9, since & 
is recursively described. By lemma 1, transfinite induction on é)(r) for any 
formula is provable in 2, ,-AC. Hence we can prove, in 2,,,-AC that n 
has an (immediate) successor. Let this successor be m. By using lemma |, it 
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suffices to prove in 2,4, ,-AC that (Vq)(q below p in ¢) > Qq) -> Op, where 
p is below m. Arguing in 2}, ,-AC, if p is the least point in 9, then Op. If 
p has an (immediate) predecessor, g, then clearly Qq > Qp, using 17,-CA. If 
p is a limit, we assume (Vq)(q below p — Qq), and we define X(n, g) as 
n= <b, q> &q is below p in @ & G/)(AZY, e0(@)) & f(b) = 0) et 
Y(n, g) be n = <b, q> & q below p in & & (VA HKS, é0(9)) > f(b) = 0). 
Since p is below m, we can use transfinite induction on ¢9(m) in 244 ,-AC to 
prove, in 2,4,-AC, that X(n, 9) Y(n, g). We can also find predicates 
X"(n, g), ¥'(n, g), where X* is 3,4,, Y Wis T,,,, and X<+ ay ae 
provable in 23, ,-AC. Since Aj,,-CA < 244 ,-AC, we have, in 22,,-AC, 
that (4x)(Vn)(n € x <>» X(n, g)). Hence we have, in 2,,,-AC, that (@f)(Vn) 
((f(2) = 0 X(n, g)) & (f(n) = 1 ~ X(n, g))). Hence we have Qp. 

In Feferman’s contribution [1] on the theories ID’, we have a formulation 
of the theory ID°, where e is the recursive index of an ordering. 

If we take the union of the systems ID”, we obtain the system ID 

Now each P, in ID** has a standard interpretation as a function recursive 
in the unique f with H,(f, 9(7)), for some n. In this way, we can easily obtain 


<£0 


Lemma 8. Let A, be the usual positive predicate used in the standard inductive _ 
definition of O. Let n be such that P,,(n) is provable in 1D“. Then née O 
is provable in (ITj-CA)<®. 

By using the completeness of hyperjump among JJ; predicates, a straight- 
forward argument can be given, which is sketched in Feferman’s contribu- 
tion, 


THEOREM 11. For any n, ID<° proves P,4,(n) if and only if (IIj-CA)<” 
proves ne O, 


This theorem 11, combined with theorems 5 and 6 give relationships be- 
tween 2}-DC and ID<"; in particular, that these theories have the same 
‘provable ordinals’. 

When the w-rule is added to the theories 2;-AC, analogous results may 
be obtained. In the theorem, the theories (17;,-CA)*®! play the role of 
(11;-CA)**, where q, is the first nonrecursive ordinal. 
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ASSIGNMENT OF ORDINALS TO TERMS FOR PRIMITIVE 
RECURSIVE FUNCTIONALS OF FINITE TYPE* 


W. A. HOWARD 


Introduction. Gentzen [2] showed that the consistency of first order 
(classical or intuitionistic) arithmetic can be proved by methods which are 
finitistic except for the use of the descending chain principle for the ordinals 
less than é9. On the other hand, Gédel [4] gave an interpretation of first 
order intuitionistic arithmetic # in a quantifier-free theory 7 of primitive 
recursive functionals of finite type, thereby reducing the consistency of # 
to that of 7. 

In the following, the terms of 7 will be given in the J-calculus provided 
with a finite type structure. We establish a direct connection between 7 
and é by giving an assignment of ordinals less than eg to the terms of 7 
with the property that the reduction of a term (in the sense of j-conversion) 
lowers the corresponding ordinal. Unfortunately we have been able to do 
this only for restricted reductions; i.e., reductions arising essentially from 
the contraction of closed subterms (§ 1). However, in §4 we extend our 
result to the case of arbitrary reductions by the use of non-unique assign- 
ments of ordinals to terms. This allows us to assign to every reduction 
sequence of terms 


Mingted 4 (eGe med 4 neG «a5 


a corresponding descending sequence of ordinals 


Cy Sa aaa 


If the ordinals less than ¢9 and the terms of Y are enumerated in a 
natural way, our discussion can be formalized in Skolem (free variable, 
first order, primitive recursive) arithmetic. Thus the ‘computability’ of the 


* Mention should be made of the paper of L. E. Sanchis, Functionals defined by recursion, 
Notre Dame J. Formal Logic 8 (1967) 161-174. 
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terms of 7 in the sense of Tait [5], namely the reducibility of each term of 
ZF to normal form, follows from the descending chain principle for the 
ordinals less than ¢). Tait uses combinator-terms rather than A-terms but 
it is easy to model combinators in the system of A-terms. 

Our treatment is developed in a rather general form because we intend 
in the future to extend our results to certain extensions of 7: in particular 
to type-zero-bar-recursive functionals. For extensions of 7, the ordinals 
less than €9 may no longer be adequate. Indeed, we shall show in the future 
that for the analysis of the extension of ZY just mentioned, the appropriate 
ordinals are those less than Bachmann’s ordinal @,(0), c = &94, (as 
developed in H. Gerber’s paper [3]). 


1. Terms. The terms of Y are obtained from the prime terms of 7 by 
applying the following operations finitely often. 

(i). Application: from terms A and B get AB, with the restriction on the 
pair A, B given in clause (a) under Type levels below. The interpretation 
of AB is the value of the functional A when applied to the argument B. 

(ii). A-abstraction: from any variable X and term A get AX.A. 


Ty pe levels. The general situation to which the method of the present paper 
applies is: one has a system of terms, generated as just described, in which 
it is possible to assign to each term A a non-negative integer level(A) in 
such a way that whenever AB is well-formed, level(A) > level(B) and 
level(A) > level(AB). In the case of F this assignment is obtained by 
first assigning to each term A a type symbol, called the type of A, as follows. 
Type symbols are generated from a prime type symbol 0 by means of the 
operation: from type symbols ¢ and t get the type symbol (c)t. After type 
symbols are assigned to the prime terms of 7 (see Prime terms below) 
the assignment of type symbols to the remaining terms of 7 is determined 
by the following two clauses: 

(a). If A has type (o)t and B has type o, then AB is well-formed and 
has type Tt. 

(b). If the variable X has type o and A has type t, then 1X.A is well- 
formed and has type (c)t. 

The /Jevel of a type symbol is defined inductively by the following two 
clauses: 

(i). The level of the type symbol 0 is zero. 

(ii). The level of (a)t is the maximum of 1 +level(c) and level(t). 
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It is easily seen that every type symbol o other than 0 has the form 
(o,)...(¢,)0 and that the level of o is the maximum of 


1+level(o,),..., 1+level(a,). 


The level of a term A is defined to be the level of the type symbol assigned 
to A. : 
The length of a term is defined by the following clauses: 
(i). Prime terms have length 1. 
(ii). The length of AB is length(A)+length(B). 
(iii). The length of 2¥.A is 1+length(A). 


Prime terms. The prime terms of 7 are as follows: 

(i). Variables of every type. 

(ii). A numeral n of type 0 corresponding to each non-negative integer n. 

(iii). A constant 3 of type (0)0 for the successor function. 

(iv). For each type symbol o, a constant R of type (0)((0)(a)o)(c)o for 
a ‘primitive recursion functional’. 

(v). For each type symbol o and non-negative integer n, a constant R” 
of type ((0)(¢)c)(c)o for a ‘restricted primitive recursion functional’. 


Freeand bound variables. In the generation of terms by the two processes 
of application and /-abstraction, the (possibly vacuous) occurrences of 
a variable Y in a term are said to be free until the stage is reached where 
the operation 1X is applied; thus the free occurrences of XY in a term A 
become bound occurrences in 7X.A. A term is said to be closed if it has 
no non-vacuous occurrences of a free variable. 


Subform and subterm. In order to define the notion of subform, let |B| 
denote the set of subforms of a term B. Then 

(i). |B] = {B} if Bis prime. 

(ii). [AB] = |A| U |B] U {AB}. 

(iti). [AXA] = |A] U {AX. A}. 
The notion of subterm is defined similarly except that clause (iii) (where 
|A| now stands for the set of subterms of A) is replaced by the clause 

(iti)*. [AX. Al = ([A] U (AX. A})—-Z, 


where & denotes the set of all those subterms of A which contain a (non- 
vacuous) free occurrence of X. 
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Notation. For terms A, B and C, ABC denotes (AB)C. More generally, 
A,A,A3...A, denotes (...(4,A2)A3.--)A,. The result of substituting 
a term B for all free occurrences of a variabie Y in A is denoted by [B/X]A. 


Contractions. In the following schemata, A contr B means: A contracts 
into B. 
(i). A-contraction: (AX.A)B contr [B/X]A so long as no free occur- 
rence of a variable in B becomes bound in [B/X JA. 
(ii). $n contr n+1. 
(iii). Rn contr R®. 
(iva). R°HG contr G. 
(ivb). R"*'HG contr Hn(R"HG). 


Restricted reductions. A red B if B arises from A by contracting one 
occurrence of a subferm of A. 


General reductions. A red B if B arises from A by contracting one occur- 
rence of a subform of A. 


Motivation for our formulation. In case our formulation of the terms of 
ZF appears peculiar, the following motivation should be borne in mind. 
As mentioned in the introduction, the results of the present paper, together 
with the descending chain principle for the ordinals less than &9, yield the 
computability of the terms of 7; i.e., the reducibility of each term A to an 
irreducible term: the so-called normal form of A. Thus in the present paper 
we are concerned with an analysis of the terms of Y, whereas F as a 
formal system contains equations between terms and propositional com- 
binations of such equations. Once the computability of the closed terms of 
ZF has been established, then intensional equality can be introduced: two 
terms are said to be intensionally equal if they have normal forms which are 
congruent (i.e., the same except for changes of bound variables). For this 
purpose it is necessary to be assured that two normal forms of the same 
term are congruent. Tait [5] achieves this by imposing a rule which uniquely 
determines for each term A the subterm of A that is allowed to be contracted 
in reducing A. For more general reduction procedures, the uniqueness 
(up to congruence) of the normal form of a term is assured by a well-known 
theorem of Church and Rosser. 

The notion of intensional equality, just described, provides a truth 
valuation for the closed formulae of 7 (i.e. propositional combinations of 
equations between closed terms), Thus one gets a proof-theoretic analysis 
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of the system 7° obtained by restricting 7 to closed formulae. Finally 
one gets a proof-theoretic analysis of 7 itself from the observation that a 
proof in 7 can be transformed into a proof in 7° by replacing all free 
variables by suitable constants, starting from the end of the proof and 
working back, the induction rule in 7 being eliminated by the calculation 
of numerical constants. 

In the discussion of Z (and its extensions), we permit any methods 
formalizable in Skolem (free variable, first order, primitive recursive) 
arithmetic. Hence sets of axioms of the form A contr B are permissible so 
long as the number of B (in a natural enumeration of terms) is a primitive 
recursive function of the number of A. 

In view of the motivation just described, the contractions given above 
for the constants R and R” are sufficient; i.e., it is not necessary to require 
the stronger contraction Ax. R(8x)HG contr Ax. Hx(RxHG). 


2. A theory & of expressions 


Introduction. In the present section we introduce a theory & whose objects 
are expressions. Expressions are generated from constants and from variables 
x, by two operations: from expressions f and g obtain f+g and (f, g). By 
a vector of level n is meant an n+1-tuple h = <ho,...,/,> of expressions 
h;. We introduce operations 0 and 6” which produce vectors f 0 g and 6'f 
from vectors f, g. 

The motivation of the present section is as follows. In §3 we shall define 
a mapping from the terms of 7 into vectors which assigns to each term of 
type level 1 a vector of the same level. In particular, to the variable X” of 
type level 1 is assigned the vector x” = <(xo,..., X,>, it being assumed that 
the variables of Z have been enumerated: X°, X',..., X",.... More 
generally, the presence of a free variable X” in a term H is reflected by the 
presence of the variables x$,..., x, in some of the components A; of the 
vector h assigned to H. It is crucial, however, that x} is not contained in 
h, for k >j (i.e. h belongs to the class C defined in the present section). 
If f and g are assigned to F and G, respectively, then to FG is assigned the 
vector h, of the proper level, whose components are equal to the corre- 
sponding components of f 0 g. If h is assigned to H, then 6°h is assigned to 
2X". H, Thus 6” is a kind of ‘abstraction’ operator which maps a vector h 
which contains the variables xj into another vector 6’h which does not 
contain the variables x}. 
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As will be mentioned in the present section, the theory & of expressions 
has interpretations in which the constants are ordinals and the variables 
range over a set of ordinals (or ordinal notations). In such interpretations 
an expression is interpreted as a function (more precisely: intensional 
function) of the variables that it contains. 

In §3 by the expression assigned to H is meant the initial component hg 
of the vector h assigned to H. When expressions are interpreted by means 
of ordinals the expression hg becomes an ordinal if A is closed, and this 
is taken to be the ordinal assigned to A. 


The theory &. We construct expressions from: constants, variables x’, 
the symbol +, and the symbol (-, -) as follows: 
(i). A constant is an expression. 

(ii). A variable x} is an expression. 

(iii). If f and g are expressions, so are f+ g and (J, g). 

It is assumed that among the constants of & there are three constants 
denoted by 0, 1 and o. 

The theory @ is an axiomatic theory of a relation < between expressions. 
Equality between expressions is treated axiomatically: it is assumed to be - 
reflexive and to obey the replacement axiom. f> g means g <f. fxg 
means: f < g or f= g. 


Axioms for & 
2.1. Iff < g and g <A, then f < h. 
2,2, lt f= g, theny 90. 
23. fargo = Gee fag hs jema)r 
2.4. If f < g, then f+h < g+h. 
2.5. f+g =fif and only if g = 0. 
2.6.0<f; 0<1 <a. 
2.7. If f < w and g < w, thenf+g <a. 
2.8. (£gt+h) = (f.9)+G4). 
2.9. If g < cand h <c, then (g,f)+(A,f) < (c.f). 
2.10. If f < g, then (h, f) < (A, g). 
2.11. Iff < g and h # 0, then (f, h) < (g, A). 
C10 f) A 
2.13. (f, (9, h)) = (+9, h). 
From 2.5 and 2.8 it is easy to prove: (f,0) = 0. From 2.4, 2.5 and 2.9 
it is easy to prove 
2.14. If g > 0 and h > 0, then (g, f)+(h, f) < (g+h,f). 
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Interpretation of &. For the analysis of primitive recursive functionals in 
§ 3, the following interpretation of & is used. The variables range over 
ordinals (or ordinal notations) less than ¢); expressions are interpreted 
as functions, of the variables contained in them, in a manner to be described 
presently; a < 5 is interpreted as meaning: a < b (the ordinary ordering of 
the ordinals) for all values of the variables. a+b is interpreted as the natural 
(i.e. Hessenberg) sum a+b which is defined as follows (Bachmann [1]): 

Represent a and 6 in Cantor normal form a = wo'+...+@™ and 
Po ec ewierea = ee aand pb, >... = b_> then 


Ea p=] oO +o, +a ™, 


where the sequence c, >... > C,4, 1S a rearrangement of the sequence 
eC, Oe tinstes Ope 

To define (a, 5) in the case b # 0, represent 6 in Cantor normal form to 
the base 2: b= 2°'+... +2", where b; >... > 5,. Then take (a,b) 
to be 27+ ...2™, where ec, =a +b, (1 <i <n). Finally take (a, 0) 
to be 0. 

It is easy to see that this interpretation satisfies the axioms 2.1-2.13. 


Expression vectors. If fo,...,f, are expressions, the n+Il-tuple f = 
= <fo,.-+>f,> is called a vector of level n; and for 0 < i < nthe expression 
f, is called the ith component of f. We also use (f); to denote the ith com- 
ponent of f. If i > level(f) then (f); is defined to be 0. We shall often write 
f, for (f);. We define f+g to be the vector h of level max {level(f), level(g)} 
such that h; = f;+g; (0 < i < level(h)). 


The operation fO g. Let n denote max {level(f), level(g)}. Then fO g is 
defined to be the vector h = <ho,..., 4,> such that 


hy =Sn+9ns 
h, = (Aisi, fit9:) forO<i<n. 


Clearly fO g = g Uf, by axiom 2.3. 
The following four lemmas are easy to prove from axioms 2.1-2.13 by 
downward induction on i. 


Lemma 2.1. If f, > O for alli < n, then (fg); > Ofor alli < n. 
LEMMA 2.2. (f0 g); =f; for all i. 
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LemMMA 2.3. Assume level(f) = level(g) =n, and f; = g; for O<i<n. 
Then (fO h); = (g Oh); for all i. 


LEMMA 2.4. Under the assumption of lemma 2.3 and the additional assumption 
fi>9; for 0<i<k, some k <n, we have: (fOh);>(gOh); for 
Ok, 


We now prove: 


LEMMA 2.5. Assume level(f) = level(g) = n+1 > level(h) and f, > 0, 
g; > Ofor0 <i < n+l. Then 


(fOh);+(9 0h); < (f+9) oh) forO <is n+l. 

ProoF. The proof is by downward induction on i. The lemma is clearly 
true for i = n+1. To obtain the induction step, let a, b and c; denote 
fOh, gOh and f;+g9;+h;, respectively. Observe that by lemma 2.1 
a;4, > Oand b;,, > 0 for all i < n+1. Hence by 2.14 

(i415 i) +(Dists C) X (Gin + is 1s Ci): 
It is easy to see that 

(f Dh); +(9 OA); XK (4:41, 6) +(di41, &), 
whereas by induction hypothesis 


Gait bins X< (S49) 9 Dist, 
so cs 


(4:41 +bi41, 1) < (ft+g) 9 h);. 
Notation. kf denotes f+ ... +f (kK summands). 
LEMMA 2.6. Let the assumptions be as in lemma 2.5. Let d be a vector such that 
2fnt1t2Gn+1 < G41 and f;+g9; < d; for alli < n. Then 
2((foh)o(goh));< (doh), foralli< n+l. 


Proor. Let a, b and c; denote fOh, g Oh and f;+9;+h;, respectively. 
Let e denote a0 b. The induction hypothesis is 2e;,,; < (dO h);,, for a 
given i < n+1. We must prove 2e; < (d 0 h);. Observe that 


a, +b; = (Gis. .Si th) +(bia1, gi+hi)- 
Hence 
a+b; < (a;445 6) +(bi41> &)- 
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But e; = (e€;.,,4;+5;). Hence by use of the axioms 2.10, 2.8 and 2.13 
we conclude 
Ca (Cities > ci) +(Ci41 + bi 41 eer): 


By lemma 2.1 5,,, > 0. Also e;4, 2 @;4,+5;,,. Hence e;4, > a;4;. 
Similarly e;4, > 5;4,. Hence e; < (2e;,;,c¢;) by axiom 2.9. Hence 
2e; < ((d0 k);41,¢;) by the induction hypothesis and axiom 2.9. The 
desired result 2e; < (d 0 h); now follows from c; < d;+h;. 


The classes C; and C, Recall that the variables occurring in expressions are 
taken from a list of variables x}. We now define the classes C; of expressions 
by four clauses. 

Starting clauses: 

(i). If the expression / contains no variables, then A is in C;. 

(ii). For every r, the variable x; is in C;. 

Inductive clauses: 

(iii). If f and g are in C,, then so is f+. 

(iv); lf fis in €,,, and g isin C,, then (/, g) is in C,. 
A crucial property of expressions in C; is given by the following lemma. 


LemMA 2.7. If h is in C, then h contains no variable x’; such that j < i. 
PRooF. By induction on the number of applications of clauses (iii) and 
(iv) in the definition of the classes C;. 


The class C is defined to consist of all vectors A such that (A), is in C; 
(0 <i < level(h)). 


LEMMA 2.8. If f and g are in C, then soisfQ g. 
Proor. Immediate from clauses (iii) and (iv) in the definition of the 
classes C;. 


Notation. [e/x5]h denotes the result of substituting e for all occurrences of 
x; in the expression h. 


Lemma 2.9. Suppose e is in C;. Then the operation of substituting e for x} 
transforms each class C;, into itself (i = 0,1, 2,...). 

Proor. We must prove the assertion: for all i and all 4, if / is in C,; then 
[e/x)]h is in C;. It is easy to verify this assertion if h is as in clauses (i) or 
(ii), and then prove the assertion by induction on the number of applications 
of clauses (iii) and (iv) in the formation of h. 


452 W. A. HOWARD F XXVIII 


The operation 6’. It will now be assumed that by means of a function n(r) 
we have obtained a list of vector variables x’ = <x4,..., Ai by, 
(r= 0,1,2,...). To each Ain Uc; we associate a vector 6" in C, such that 
6°h has level n(r)+1 and does not contain any component of x’, as follows. 
In order to avoid ambiguity in clause (a), below, we shall regard h in C, 
as being completely given only when the particular C;, to which / belongs, 
is specified. 

Starting clauses: 

(a). If 4 is in C; and contains no component of x’, then 6h is the vector 
of level n(r) such that (6"A);=A+1 and (6"2);=1 when j #i, 
0<j<n(r)+1. 

(b). If # is x}, then (Oh), = 1 (0 < jf < a(r)+1). 

Inductive clauses: 

(c). If A contains a component of x” and h = f+g, where fand g are in 
C;, then d°h = o'f+6'g. 

(d). If # contains a component of x’ and A = (f, g), where f is in C,4, 
and g is in C;, then 

(Oh); = (Of), +(60'9), if O<j < n(r), 
and 
(Oh); = 2(6°f),+2(6'9); +1 if j = n(r)+1. 
We also define 6” as acting on vectors h = <hy,...,h,> in C as follows: 


(Oh); = (Oho), +... +(0h,); if O< j < n(r)+1, 
and 
(Ch); =h,;+1 if n(r)t+i <j < p. 


By lemma 2.7 6’h contains no component of x’. 


LEMMA 2.10, Suppose e is in C; and contains no component of x’. Suppose 
s # r. Then for any h in \) C, 


O'[elxi]h = [e/x5]o"h. 


Proor. By induction on the number of applications of clauses (iii) and 
(iv) in the definition of 6”. 


COROLLARY. Suppose e is in C and contains no component of x". Suppose 
s #1r. Then for any h inC 


O'[e/x*]h = [e/x*]o°h. 


(We assume level(x*) = level(e).) 
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LEMMA 2.11. Let e be a vector of level n(r) and assume h is in C;. Then 
((O°A) Ge); > [e/x" A. 

Proor. By induction on the number of applications of clauses (iii) and 
(iv) in the definition of 5’. Clause (iii) is handled by lemma 2.5. Clause 
(iv) is handled by lemma 2.6. Namely, suppose h = (f, 9), where f is in 
C;+, and g isin C;. Denote (6’f) 0 e and (6’g) C e by u and », respectively. 
Then ((6°A) 0 e); > (uO v); by lemma 2.6. But (uO v); > (uj41,0;)) > (69); 
since u;,, > f and v; > g by induction hypothesis. 


Coroiary. If h is in C and e has level n(r), then ((d"h) 0 e); > ([e/x"Jh); 
for alli < level(h). 


LeMMA 2.12. Let a, b and h be expressions. Suppose a> b. Then 
[alxt]h > [b/xi]h. 

Proor. By induction on the number of times the operations (iii) under 
Expressions above are used in building up A. 


LemMA 2.13. Let a, b and h be expressions. Suppose a > b. Suppose further 
that h is in Cg and that xo is contained in h non-vacuously. Then 
[a/xo]h > [b/xo]h. 

ProoF, By induction on the number of times the operations (iii) and (iv) 
under The classes C; and C above are used in building up A. In particular, 
when his (f, g), where fe C, and g € Cy, we knowby lemma 2.7 that x is 
contained non-vacuously in g; so axiom 2.10 can be applied. 


3. Assignment to terms of 7. The main discussion in this and the following 
section applies not only to 7 but to any extension YW of ZF. Of course 
we treat only the contractions given in § |; i.e. A-contractions and the con- 
tractions peculiar to 7. It is assumed that the terms of Y have been 
provided with type levels satisfying the conditions given in §1; namely, 
whenever FG is well-formed, then level(F) > level(G) and level(F) > 
> level(FG). Indeed, more generally, we need make no restriction on the 
formation of terms FG so long as we allow the contraction of FG only 
when the above conditions on the type levels are satisfied. 

If a term H of Y is not of the form FG or AX.F, then H is said to be 
prime. In the present section we shall make assignments of vectors to the 
prime terms of 7 and shall give prescriptions for passing from the assign- 
ment of vectors to F and G to the assignment of vectors to FG and 1X. F. 
Hence for our assignment to be complete it is necessary to have, in addition, 
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a prescription for assigning vectors to the prime terms of &% which are not 
prime terms of 7. For the purpose of this and the following section we 
need only assume that such a prescription exists and satisfies the condition 
that if hk is assigned to the prime term H of level n, then A; > 0 for all 
0S tt 

It is assumed that the variables of &Y have been enumerated: 
X°, X',...,X",..., and that the vector variables x" of §2 have been 
listed in such a way that x" has the same level as X". We shall assign to each 
term Fin Ya vector f in C such that level(f) = level(F). By the expression 
assigned to Fis meant the initial component fo of the vector f assigned to F. 


Assignment to prime terms of 7 

(i). To the variable X” is assigned x’. 

(ii). To the numeral n is assigned <1). 

(iii). To the constant 8 for the successor function is assigned <1, 1. 

(iv). To the primitive recursion constant R of type level n is assigned the 
veeLor < leauge) |) o> Oldevelun. 

(v). To the ‘restricted primitive recursion’ constant R* of type level n 
is assigned the vector <1, ..., 1, 2(A+1)> of level x. 


Terms formed by application. Suppose f and g have been assigned to terms 
F and G, respectively; and that FG is well-formed and has type level n. 
Then to FG is assigned the vector h such that (hk); = (fOg); (0 <i <n). 


Terms formed by i-abstraction. Suppose h has been assigned to the term H. 
Then to AX".H is assigned the vector 6"h, where 6” is the operator defined 
in § 2, 


LEMMA 3.1. Let X° be a free variable of H. Let F be a term of the same type 
as X* and assume no free variable of F becomes bound in [F/X°]H. Suppose 
fand h are vectors assigned to F and H, respectively. Then [f/x*]h is a vector 
assigned to [F/X*]H. 

Proor. By induction on the number of times the operations of application 
and A-abstraction are used in building up H from prime terms. To handle 
A-abstraction, suppose H is AX".G, where r # s. Then 6’ is assigned to H, 
where g is the vector assigned to G. By induction hypothesis [f/x*]g is 
assigned to [F/X*°]G. Hence from the fact that AX’. [F/X°]G is [F/X°]H 
we conclude that 6’ [f/x*]g is assigned to the latter term. But by lemma 2.10 
and corollary é"[f/x*]g = [f/x*]6’g. 
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Addition of constants R’ to 7 . Besides the constants R”, let us introduce a 
constant R”, of the same type, for each expression v. To R’ assign the vector 
<1,..., 1, 2(v+1)> of the appropriate level. 


LEMMA 3.2. Let e and f be the vectors assigned to HO(R’HG) and R°’*' HG, 
respectively. Then e; < f, for 0 < i < level(e). 

Proor. Let n denote level(e); so G, H and R” have levels n,n+1 andn+2, 
respectively. Let a be the vector of level n+1 defined as follows: a,,, = 
=e and @i— (qq li.) ton 0 < 1 < 7. Let b, e and t be the 
vectors assigned to R’H, G and R°HG, respectively. Then for i < n 


€; X (a0t);<(a0(bOc)),. 


Clearly a; < (aOc); for all i < level(a). Hence e; < ((aOc)O (bOc)); 
for all i < n. Hence it is sufficient to prove (a0 c)O(bOc)); < (doc); 
for all i < n+1, where d is the vector assigned to R’*'H. By lemma 2.1 
b; > 0 for i< n+1. Clearly a; > 0 for i< n+1. Hence by lemma 2.6 
it is sufficient to prove 


20 2 Ons A ie (3.1) 
a;+b;<d; for i< n+l. (3.2) 


To prove (3.1) observe 2a,,, = 2(0,1+4,4,) < (1, 1+4,4,) by axiom 2.9; 
and 26,4, < (2v0+3,1+/,4,) by axiom 2.9. Hence 2a,,,+2b,4, < 
< 2(2v+3,1+hy41) < d,41 by axiom 2.9. To prove (3.2) by downward 
induction on i, observe that for i < n, a;+b; is equal to 


(4i41, 1+hj)+ Gi41, 1+A)). 


Hence a;+b; < (@;4,;+6;41, 1+/;) by 2.14. Hence, by induction hypothe- 
sis, a;+b; < (aaa 1+h;) == d;. 


Remark. In lemma 3.2 it would be desirable to replace R’*’ by R” such 
that w > v. The difficulty would then arise, in the present proof, of estab- 
lishing (3.1). This difficulty vanishes if we assume that & has the property 
A(x, y) < (z, y) whenever x < z. This property holds for the interpretation 
of & given in § 2 if, in the definition of (a, b), one uses the Cantor normal 
form to a base greater than or equal to 4 instead of 2. 


THEOREM 3.1. Suppose A contr B, and let a and b be the vectors assigned to 
A and B, respectively. Then a; > b; for alli < level(a). 
ProoF. It is easy to verify theorem 3.1 for the contractions $n contr n+ 1 
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and Rncontr R". The verification of theorem 3.1 for the contractions 
R°HG contr G and R"*'HG contr Hn(R"HG) is provided by lemmas 2.1 
and 3.2, respectively. It remains to examine the contraction (AX°H)F contr 
[F/X*]H. Let f and h be the vectors assigned to F and H, respectively. Then 
[f/x*]h is the vector assigned to [F/X*]H by lemma 3.1. But for all 
i < level(h) the ith component of the vector assigned to (AX*- H)F is 
(sh) of), 


The desired result now follows from the corollary to lemma 2.11. 


THEOREM 3.2. Suppose F reduces to G by contraction of one occurrence of a 
subterm A of F (restricted reduction of F ). Let f and g be the vectors assigned 
to F and G, respectively. Then f, = g; for 1 <i< level(f), and fy > go. 

Proor. It is easy to see that since A is a subterm of F, there is a term H 
such that F is [4/X"]H, where X" has one free occurrence in H. Suppose A 
contracts to B. Then G is [B/X"]H. Let a, b and h be the vectors assigned 
to A, B and H, respectively. By lemma 3.1, f and g are [a/x"]h and [b/x"]h, 
respectively. It is easy to see that the non-vacuous free occurrence of X’ 
in H implies the non-vacuous occurrence of x in ig. Theorem 3.2 now 
follows from theorem 3.1 together with lemmas 2.12 and 2.13. 


Assignment of ordinals to terms. To obtain an assignment of ordinals less 
than ¢) to the terms of Y proceed as follows. First assign expressions f, 
to terms F in the manner described in the present section. Next apply the 
interpretation of expressions given in §2 under Interpretation of &. This 
results in the assignment of a function to F; so apply this function to a 
suitable constant (say 0), in all argument places, to get an ordinal: this is 
the ordinal assigned to F. By theorem 3.2 reduction of F lowers the ordinal 
assigned to F, 


4. General reductions. At first sight it might appear that the assignment 
given in §3 is valid for general reductions; i.e. that theorem 3.2 is true 
even when A is a subform. Namely, one might try to prove this generaliza- 
tion of theorem 3.2 by induction on the length of F. This attempt fails when 
F is of the form AX’.H for the following reason. Suppose AX’. A is 
reduced to AX".G by contracting a subform of H, and let h and g be the 
vectors assigned to H and G, respectively. Then A red G, so by induction 
hypothesis h; > g; for all i < level(h), and ho > go. Unfortunately this 
does not imply (6h); > (6'g); for all i < level(6"h), and (65°h), > (5'g)o 
(at least I can not prove that it does). 
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The above difficulty can be surmounted, if non-unique assignments are 
allowed, as follows. To AX". H we allow the assignment of 6’d+ [e/x"Jh, 
where ¢ is the vector <I, 1,--., 1> of the same level as x’, h is any vector 
assigned to H, and d is any vector assigned to a term D such that D reduces 
to Hein a finitenumberomstepsaD — Dy red D, red ered D, = H. 

The following integer t(F), associated with F and the manner in which 
a vector is assigned to F, is useful for the proof of the remaining results of 
this section: #(F) = 1 if F is prime; 


t(AB) = 1(A)+1(B); 
t(AX".H) = 1+1t(Do)+ ... +4(D,) 


where Dy, ..., D, are described above. 

Lemma 3.1 holds for the new method of assignment: with the help of 
the proof in § 3 it is easy to get a proof for the new assignments by induction 
on (#7). 

Theorem 4.1 below will be proved under the assumption of the following 
axiom for expressions b, f and g: 


4.1. If fg, then [b/xi|f > [b/xi]g. 


Axiom 4.1 clearly holds under the interpretations of the system & used in 
the present paper. 


THEOREM 4.1. Suppose F reduces to G by contraction of a subform (general 
reduction). Let f be a vector assigned to F. Then there exists a vector g 
assigned to G such that f, > g; for 1 < i < level(f), and fy > go. 

Proor. By induction on t(F). We handle A-contractions as follows. 
Suppose F = (AX’.H)B red [B/X"]H, where 6’d+[e/x"]h and b are 
assigned to AX": H and B, respectively. By induction assumption there 
exists an assignment d* to the term D, described above such that d, > dj 
for 1 <i< level(d), and dy > dj. Hence by axiom 4.1 ([b/x"]d); > 
> ((b/x"]d*); for 1 < i < level(d), and ([b/x’]d)) > ([b/x’]d*)). But D, 
is H. Hence by lemma 3.1 [b/x"]d* is an assignment to [B/X"]H. It now 
follows from the corollary to lemma 2.11 that this assignment has the re- 
quired properties. 

The case in which Fis AB, and G arises from F by contraction of a subform 
of A or B, is handled by lemma 2.4. The case in which F is AX". is handled 
by the induction hypothesis and (4.1). 
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5. More efficient assignment. Let a, = w and a4, = wo (k = 1,2,...). 
Let 7, denote the set of terms of Y which contain primitive recursion 
constants R and R” of level at most x. It is easy to see that the assignment of 
§ 3, in assigning ordinals to terms of 7 ,, makes use of the ordinals less than 
@,+1,- However the work of Tait [6], and independent work of C. Parsons 
(private communication to the author), suggests that it is the ordinals less 
than wm, rather than @,,, that should be assigned to the terms of 7,. The 
purpose of this section is to provide such an assignment which works 
so long as the following restrictions are made: if G has type level zero, 
the contractions (AX. F)G contr [G/X]F, R°HG contr G, and R"*!HG 
contr Hn(R"AG) are allowed only when G is a numeral. 

We shall modify the assignment, described in § 3, of vectors to terms so 
that to a term F will be assigned a vector f of level max{1, level(F)}. This 
assignment will have the property that if Fred G (restricted reduction) 
and if g is assigned to G, then f, > g,. The expression /, will be assigned 
to F. When expressions are interpreted by ordinals as in § 3, this will yield 
an assignment to the terms of 7, by ordinals less than @,. 

The modifications are as follows. When r is such that X” has type level 
zero, the corresponding x’ is taken to be <x, x|> rather than <xo> (cf. 
§ 2, The operation 6’). 

To numerals assign the vector <1, 0>. If X” has type level zero, and h is 
assigned to H, then to AX".H assign the vector 6’[0/x{|h. If f and g have 
been assigned to F and G, respectively, then to FG assign the vector A such 
that level(h) = max{1, level(FG)} and h; = (fg); for 0<i< level(h). 

Using the method of §3 and the fact that, for all b, (60 <1, 0); = 
= (bo <1)); for all i, it is easy to verify that the modified assignment has 


the required properties. 
University of Illinois at Chicago circle. 
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ON A NUMBER THEORETIC CHOICE SCHEMA 
AND ITS RELATION TO INDUCTION 


CHARLES PARSONS 


The usual first order number theory Z can be viewed in a number of ways 
as the union of an infinite sequence of subtheories whose axioms and rules 
are of ascending complexity. In this paper we introduce and discuss the 
elementary relations of several hierarchies of subsystems based either on re- 
stricted induction principles or on others of a set theoretic character. The 
main result is an independence theorem for ‘finite axioms of choice’ which 
implies that for every n > 0, the axiom schema of induction with < n nested 
quantifiers is stronger than the rule of induction with < n nested quantifiers. 
This latter theorem is announced in [8] (and for n = 1 in [6]) on the basis 
of a different proof. 

The paper is in the context of a larger proof theoretic investigation 
of subsystems of number theory, begun in [5], of which results are reported 
in [6], [7], and [8]. A paper in preparation will develop the work of [8] and 
give information about these subsystems based on Gédel’s interpretation. 


1. Basic notions. We consider systems whose formulae are in the notation 
of elementary number theory. We use x, y, Z,... for bound variables, a, b, 
c,...for free variables, f, g, h, ... for function symbols. Terms are built 
from 0, S (successor), variables, and certain function symbols in the usual 
way. Weuse r,s, t,... for terms. Formulae are composed from equations 
s = t by the connectives —, A, v, > and the quantifiers Vx and 4x. 

Let Z, be a system which has for each definition of a function @ elementary 
in the Kalmar sense ([3], p. 285) symbols for the functions involved with 
the recursion equations as axioms. Z, has also the usual axioms for equality 
and successor and the rule of induction applied to quantifier free formulae. 
Zo has the deductive apparatus of classical first order logic. 

By well-known arguments we can operate in Zy on formulae with only 
bounded quantifiers as if they were quantifier free formulae; hence if A is 
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any such formula, by the rule of induction the formula 
IA ADAVx <a (Ax > A(Sx)). > Aa 


is provable in Z). IA implies the usual axiom of induction on A. 
In addition to JA and the rule of induction 
AO Aa > A(Sa) 
Aa 


IR 


we consider some other schemata suggested by the fact that elementary num- 
ber theory can be derived in the theory of finite sets: 


FAC ere Mea) iy, bela = fine ances hee Ea ADE 
AS dyvx <a [(y), = OA Ax. v.(y), = 1A AX] 
M Vx < a dyAxy > JwV¥x < a dy < w Axy 


((y), is the xth exponent of y; [3], p. 230.) FAC is a sort of choice axiom; 
AS is a version of the Aussonderungsaxiom; M (for maximum) is related 
to the axiom of replacement. 

If Sis one of our schemata, let S,, be S restricted to formulae A with < n 
nested quantifiers; let S, be S restricted to A with < n nested unbounded 
quantifiers. S?(S}") is S restricted to A which are Z?(J7?). (Hereafter we omit 
the superscript °). S(B) is the instance of S with B in place of A. 

The notation for a schema will be used for the system obtained by adding 
it as an axiom or rule to Z).S > S’ means that every theorem of S’ is 
a theorem of S. 

We now state some simple facts about the relations of these schemata. 


Lemma 1. In FAC?,,, every formula with < n nested unbounded quantifiers 
is equivalent to a X,,4, formula, 
Proor. FAC?,, implies, for any A with < n nested quantifiers, the for- 
mulae 
Vx < a dyAxy = AyVx < a Afx, (y),] (1) 


dx 2 a VyAxy = VyAx < a A[x, (y),]. (2) 


If B has < n nested unbounded quantifiers, it is equivalent to a prenex for- 
mula which, except for bounded quantifiers, is 2,,,. Starting from inside, 
these bounded quantifiers can be transposed inward by (1) and (2); when one 
reaches those outside the outermost unbounded quantifier, one is operating 
ona 2,4, formula. 
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LemMA 2. (a). FAC? « FAC?,, © FAC; 
(b). FAC” = AS” «> AS’ a IA} 
(Cues uA 
(d). IR, IA; 
(ce). IAZ,, > FAC? 
(f» TR, ise AS; 
Proor. (a). The right-left arrows are obvious. If Vx < a Jydz Bxyz 


where B is J, then, if v is a pairing function with inverses v’, v, we infer 
Vx < ay B(x, v'y, vy); by FAC? there is a y such that 


eS OBE). Tay). 


vi(y)i 


Then clearly if y’ = [ica D; 
Nx = Gc FX, () ele 
FAC?,, > FAC, is immediate from lemma 1. 
(b). AS(A) can be proved by applying FAC to 
y=OAAx. V. y= 1A AX; 


by (a), if A is IT, this can be done in FAC®. 
We postpone proving AS” > AS}. Sinse AS(A) can be proved by IA on 


ely [fm ve I ORE Ge le Zeal 


iga 


IA, — AS, is evident. Conversely if 
Vx <a [p= OA Ax. v.(), = 1A AZ] (3) 
then the induction axiom (provable in Zy) 


(V)o = OAVx <a [(y), = 0 > (y)sx = 0]. > (vy), = 0 


implies IA(A), and since by the substitution restriction in AS Aa cannot 
contain free y, the existential quantification of (3) by y implies IA(A). 

(c). Trivial. 

(d). IA, + IR;, follows from the standard reduction of IR to IA; IR}, > 
IA, by the proof given above for the case n = 0. 

(e). FAC(A) can be proved with the induction axiom 


dyvx < 0 A[x, (vy), JAVz <a {AyVx < z A[x, (y),] 
= AyVxeceSz A[x,i(y),.]} aeeapvx era Alx, (y),]. 
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We now prove AS? > AS/. AS” = AS? is obvious. Hence we have 
AGE SS UNGE Se = TENCE 


If Aa has < n nested unbounded quantifiers, it is composed by propositional 
connectives and bounded quantification from formulae JwBxw and VwCaw, 
where Baw and Caw have <n—1 unbounded quantifiers. By (a) and lemma 1 
A is equivalent to a formula A’ composed by connectives and bounded 
quantifiers from ZH, formulae. But by the formulae 


dyvx < a [(y), = 0 = (b), # 0] 
dyVx < a [(y), =0 = (6), = OA(c), = 0] 
dyvx < a [(y), =0 = Vz <t ((b)ys,2 = OD] 


Ml 


the A for which AS(A) is provable in AS? are closed under logical connec- 
tives and bounded quantifiers. 

(f). If Aa is WwBaw where Baw is Z,.,, we can prove AS” by the induc- 
tion rule on the formula 


dzdyVx < @ Vwi), = OA Bxw (yy = UA BOatz)e)] 


which is equivalent to a 2,,, formula. This proves (f). 


Lemma 3. (a) ',, FAC(A) > M(A) 
(b). t,, M(A)AVa AS(4[v' a, v'a].) > FAC(A) 
Proor. (a). If the antecedent of M(A) holds, by FAC(A), for some y, 
Vx < a A(x, (y),). Then w = max, <, (y), satisfies the consequent of M(A). 
(b). Assume M(A) and Vx < a dy Axy. Then for some w 


Vx < a dy < w Axy, 
and by logic 
Va AS(A[v'a, v?a]) > dyVx < max v(x, y) 
< 


x<a 
ysw 


[Oye OA AG a ee ey xo) 


If y satisfies the consequent of this, y’ = []j<. p; exp HZ < w ((Y)yG,2) = 0) 
satisfies Vi < a A[i, (y’),]. 


From lemma 3 we have FAC, < AS,+M,. Hence the relations of the 
systems can be diagrammed as follows: 
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Wel NO IY NOY PN Oe eS eI 


TO eg _—_—ai—ai— a ET 


Sager 


It is natural to ask whether any of the relations in the hexagonal part of the 
diagram holds in the reverse direction, and what the relation is between 
FAC/ and IR, 4. The main result below is that FAC” is not derivable even in 
IR”,,. It follows that IR,,, ~ IA?,, is false and that AS? — FAC? is 
false. IA, # IR,., follows from the fact that IR,,, suffices to prove the 
consistency of IA, (cf. [8]). It is easy to see that FAC) # IA{; we conjecture 
that the same holds for higher x. 


2. Independence results for FAC”. Since FAC” implies IA,,, it is clear that 
FAC? cannot be derived in IR,. We strengthen this observation by showing 
that it is not derivable in IR”, ,. 


THEOREM |. For every n there is an instance of FAC! which is not derivable 
in Zo from true IT,,,2 sentences. 
Proor. By lemma 2 it suffices to show that there is such an instance of M 
with A Z,,,- 
Let JyAxy be a complete 2, predicate. Let T;“eaxy be a formula which 
expresses the following: 
y is the number of the least derivation (say in Kleene’s formalism 
of recursive functions ([{3], ch. XI)) from the equations with number 
e and equations of the form g(k) = I, of an equation f(Z, X) = p (for 
some p < y), and JuVzVw < yVk < y VI < y, if w is a line of the 
derivation of the form g(k) = I, then 


l= 0aA[k, (u),].v.) = 1a Afk, z]. 


It is clear that the predicate preceding the quantifiers is elementary and is ex- 
pressed in Z, by a quantifier-free formula, and the predicate inside brackets 
is an (n—1)-quantifier predicate. Hence T;“eaxy can be chosen a 2,44 
formula. Intuitively, it expresses the Kleene 7-predicate relative to dyAxy. 
If n = 0 we omit A and use Kleene’s predicate T, ({3], p. 291). 
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Now we consider the formula 
Vx <a dy T4eaxy > wx < a Jy < w T;“eaxy. (4) 


Suppose (4) is derivable in Z) from I7,,. sentences B, ... B,. Let B; be the 
formula 

VG Sve ce Ve SE eee) ay eee) (5) 
(assuming n+2 = 2/), Expand the formalism to include symbols for the 
Skolem functions for B, ...B,; then (4) is derivable from the formulae 


Bi[x; Pec Ob) X15 1(%1); Pa F(X Seisaacess9 x,)]. (6) 


By Herbrand’s theorem we can derive from (6) and axioms of Zy, by substi- 
tution, propositional logic, and quantifier-free induction (in fact the in- 
duction formula will not contain the f;) an alternation of the form 


@pea < a AN—- T7“(e, a, Oy eA, Co) .V (by < @) 


Vay = Woeaels (ea; by, y) Vv ee 


On Os Come Crise oa ee 


A Tye, 0, Oal€s @, Coss snes Oo sR en Grivel Ue 
Vay < Wile, a, Cova Caer Dos ela lo (end Dy eee 


where the terms 9,(e, a,...) and w,(e, a,...) are composed from the varia- 
bles by the symbols ue and the function symbols of Z). Hence they express 
functions which are recursive in the Skolem functions. The Skolem functions 
satisfy 


yess aT x;) = /) = \)S BY; VX p41 5V 541 ove Vx,4y, 
Bix, pee) X1.Fi(%1), ce oa tas bat 28 cw Vio ee) yil 


which, since n+2 = 2/, is a X, predicate. Hence the functions Hi are re- 
cursive indy Axy. Hence so are the functions @; and w;. 
We set 
cj = nyT? “Le, a, 9,(e, a, Co> os Cj-4 3 bo, oa 04 Open) y| 
6; ="uz<ga — Jy < We, a, 09,..- Ej-1, bee Dan) T; “(e: a; Za) 
Then we define 
pe, a, i) = Wole,a)+1 if i= io = @of(e, a) 
sail. 1(€, Go, ae aD ae b,)+1 if i= Tjy41> 
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where 


ipea = Pj+ile, a; Connats ip Dig 0 oe b;) 
if P;+1(2, a, Gps on ay oie paneer’ bj) # igor Le 
anes (hese Towa sl Omer Wise. 


By the recursion theorem there is an e which defines Aeag(e, a, i) recursi- 
vely from R4yAxy. 

Then certainly e(a, 7) is defined for i = ip; hence there is a y satisfying 
Ty “(e, a, @oea, y); hence é is defined. If €, ..., 2, are defined and j < m, 
then e(a, i;,,) is defined (since i;4, # ip,..., tj, ane stipulations do not 
conflict), and therefore é;,, is defined. By induction, é),..., @,, are all de- 
fined. 

If we substitute our given e and @...2,,6)...5,, in (7), the result 
is false for all a > m. 

Suppose g ea < a. Since U(py TYA(e, a, poea, y)).= Woea+l, 


— dy < Woea is A(e, a, Po &a, y) 


holds; hence by < a; moreover T7(e, a, gg ea, &) is true. 
Suppose the first j alternands false and 9,4 ,(e, a, Go, .. 5 €j, Bo, . -, Bj) 


: a 5 A eS oo 5 
Ee) since ee yisvdcimed i, he, a.@ (¢, a, Cy... €), 095+ +, 0;), C44] 
is true, but 


ple, a, ij+1) > Wj4i(e, a, €o5 Oo Oy) ee oan 6 Oo bj); 


hence pees < aand 


1} ayis Wj +iles a, Co> 56 om Bae 2 605 op) Ta les a, Pons 9 y). 


(6,4; < a depends on the fact that a > m, for by ASSES of induction 
b, = i, < a for every k < j; if ij = 9j+1(€, a, ++) we know 634; < a; 
otherwise since a > m there is m’ < a, m' # ip,...,i;, and the least such 
m’ is i;4,-) Hence the (j+1)st alternand is false. 

From this refutation we can find an instance of one of the formulae (6) 
which is false. I.e. for some i and suitable terms f,,.. ., ¢;, 


ieee (ta) ce tee: ll 


mentalse. Hence 6.15 false, 1.c.not all of By,..., B,, ate true, q.e.d. 

Looked at constructively, the argument shows that if (4) is derivable from 
IT,,+2 sentences (5), we can effectively find a no-counter-example interpre- 
tation of — Byv ...vB,. 
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CoroLLAry. FAC! is not derivable in IR",,; hence for every n there is a 
formula, namely (4) with A a complete 2, predicate, which is derivable in 
JA? ,,, but not in IR, +1 or even in IR",,. 


Proor. Suppose (4) is provable in IR’... Then it follows in Z, from the 
conclusions B,,..., B,, of the last applications of induction in the proof; 


their closures are /T,,,. sentences. By the theorem, we can effectively find 
a no-counter-example interpretation of — B,v...v—8,,, which con- 
tradicts the uniform 1-consistency of IR”, ,. 


Remark. In [4] Kreisel gave an example of a‘non-Herbrand interpretable’ 
theorem of Z. Our argument for n = 0 amounts to showing (4) to be non- 
Herbrand interpretable. A formula is Herbrand interpretable if it has 
a no-counter-example interpretation by functionals explicitly defined from 
function arguments and recursive functions. 

Kreisel’s example is more elementary than it appears since the only axiom 
needed in the proof which goes beyond Z, is a 2, induction axiom. It is na- 
tural to expect such an axiom to be non-Herbrand interpretable (if the induc- 
tion predicate is not recursive), because the Gédel interpretation [2] of 
such an axiom involves the universal recursion functional with values of 
type 0: 

ROG; 92 RUS af —fipoe ans 


which depends on a variable number of values of its function arguments. 

Our example is more elementary since its interpretation involves only 
elementary functions and the course-of-values functional [];<, p/\; but a 
glance at the functional form of (4) (in the sense of [1]) 


Vx < a T,(e, a, x, fx) > Jwiy <w [gw < a > T,(e, a, gw, y)] 


indicates that the functional for w ought to depend on a values of f. 
The Gédel translation of the negative version of an instance of FAC, is 


IWV{Valvx < GAG x) ann <a Ax (Wife, 
which is obviously satisfied by taking Wfa = | |i<a pi®, Then Z,+FAC, 
can obviously be interpreted in a subsystem 7, of T in which recursions are 
restricted to values of type 0, and the only recursion with a parameter of type 


> 0 is that for W. (This system is intermediate between T5 and Ty of [8].) 
In fact this extends to IR? +FAC), and we have 


THEOREM 2. (a). Let PR be free variable primitive recursive arithmetic. Then if 
Vxdy Aaxy (A quantifier-free) is the conclusion of an induction in IR +FACo, 
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there is a constant @ of PR such that 
A(a, x, pax) 


is provable in PR (strengthens theorem | of [6]). 

(b). If B is provable in 1R7+FACo, then it is provable in IR{+FAC,. If 
B is provable in YR} it is provable in YR%. 

(c). IR +FACp is a proper subtheory of 1A; . 

Proor. (a). Let Tj be as above. We suppose it formulated in the ex- 
tensional fashion of [10]. Types are 0 and (c)t for types o, t. Let (0)°0 = 0, 
(0)"** 0 = (0)(0)"0. ((0)0 is the type of number-theoretic functions of m 
arguments.) Terms of 7) are composed from variables and constants (in- 
cluding 0 and S by application; 0 and S are given constants; further constants 
are introduced by the schema 


ED Oly ove 


where ¢ is a term whose free variables are among c,,...,¢,. (Note that if 
s and ¢ are not of type 0, s = tis an abbreviation for sa,...a, = tay... 4,, 
where @,...a, are new variables and sa, ...a, is of type 0.) If t is a term 
with free variables c,,..., ¢, and perhaps a, Aat is represented by the term 
Oc, ...¢,, where @ is defined by 


Vere eecea— 1, 
Constants are also introduced by primitive recursion 
UG ea tiger a. O(SX)a,...d,— Y(Oxd, »..a,)Xa,.~. a, 
Where di,.-, d,, x, @xd,... a, are all of type 0; and by the recursion 
WO) 92) a W(Sx)f = Wf ee 


Let A~ be the negative version of a number theoretic formula A and let 
A’ be the Gédel translation of A into a formula JFVg A,(F, g), where 
A,(F, g) is a formula of the free variable system T. Then if A is a theorem 
of IRJ+FACp and (47) is AFVg A,(F, g) then for some term s of To, 
not containing g, A,(s, g) is provable in 79. 


* Evidently To’ is closed under elementary recursions with parameters of arbitrary 
type. For example if ¢ isa term of type (0)0, I;<, ti is expressed by sa(War), where, 
sax = IJ,<, ():. 
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This is clear for the axioms, and it is well-known that the interpretation 
of logical inferences does not involve recursion (except for a few elementary 
functions to handle definition by cases). Hence it suffices to consider in- 
duction. Let 


VYxdy BOxy YVxdy Baxy > Vxdy B(Sa, x, y) 
Yxdy Baxy 


be an instance of induction. By hypothesis of induction there are constants 
Yo; W, x such that 


BOO, x, Pox) B[a, fax, f(Wfax)] = BlSa, x, xfax] (8) 


are provable in 75. It suffices to find , 0 such that 


Ox = Pox 

p(Sa)x = x(pa)ax (9) 
00ax = x 

6(Sb)ax = v[o(a~Sb), a~Sb, Obax] (10) 


are provable in 7,. For then we can derive by induction 
b <a > Bib, 0(a ~ b)ax, b(O(a + b)ax)] 


which for a = b yields B[a, x, pax]. 

(10) is an ordinary primitive recursion and will be satisfiable if (9) is. 
That (9) will be satisfiable is apparent if we observe that @ and y are ex- 
plicitly definable from primitive recursive functions and the functional W. 
Therefore yf is uniformly elementary in f and finitely many primitive re- 
cursive functions. This suggests 


LemMA 4. T9 is closed under recursions of the form (9) (a, x, pax of type 0). 

Proor. The terms @ox and yfax have ‘A-normal forms’ from which expli- 
citly defined constants have been eliminated as far as possible. Consider 
a term ¢ whose constants are among 0, S, W, W,,..., W, introduced by pri- 
mitive recursion, and others explicitly defined from these. If tf contains no 
free variables of type # 0 except f, then the normal form of f belongs to the 
set U inductively defined by 


(U1). If tis 0 ora variable of type 0, then re U. 


(U2). Iftis St, or ft, or Wit, ...t,, (w, of type (0)"0) and 1t,...4,,€ U, 
then re U. 
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(U3). If b is of type 0, tis Wr(Abs) and r, s € U, then te U. 
The heart of the proof of lemma 4 is finding an evaluation function for the 
termsin U which do not contain f. Let ag, a, ... be all the variables of type 0. 
For a suitable Gédel numbering we find a constant V of Ty such that 


V(piis +... pin, k) = t, (11) 


ifk = gn(t) and a,, ... a;, include all the free variables of t. 

To let V depend on a parameter f and to allow t in (11) to contain f re- 
quires more powerful recursions than are available in 7). For a fixed variable 
x we can define a function ocd such that if d = gn(t(f)) and c = gn(s), 
s, t(f) € U, then ocd is the g.n. of the normal form of ¢(Axs), and we do have 


r T= @) =) V (pes Ons eee Pe ock == t[Ax Vp neg © prin ry fe c)] (12) 


where tc = O expresses ‘c is the g.n. of a term in U without f’, x is a,, and in 
the right-hand product the factor Die is ommitted if i; = k. 


Clearly for any n, Ax@nx is expressed by the nth term in the sequence 


Pox, XPoOX, x(XPo0)Ix, .. .. 


Let up), u,, W2,... be the normal forms of these terms. Then we have a pii- 
mitive recursive g’ such that g’0 = gn(uy), p(Sa) = o(¢’a, ya), where 
y'n is the g.n. of the normal form v(f, 7) of yfnx. Then o’a = gn(u,). If we 
set 

pax = V(p'+ p=, ga) 


(where a; is a), then @ intuitively satisfies (9). In fact, from (11) we have 
immediately | @Ox = up = ox, and from (12) we obtain 


+ p(Sa)x = v(Axgax, a) = x(pa)ax. 
Thus the equations (9) are provable. 


To define V, with each term t € U not containing f we associate a term 7 
with a sole free variable a, which expresses a course-of-values function of 
the function expressed by ¢ in that it will have the property 

OP — & > t= (1)subips< cape (13) 
if a,,...4;, include all the free variables of t. 
i will be explicitly definable from a fixed finite set of functions (cf. [9]). 


Foi example, if with r, s we associate , §(a) respectively, then with Wr(,a,5) 
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we associate w[K, a, 7, §(M(K, a, ¥))], where 


NO ef ee 
=] if r=k 
Mga = [pre i asi) 


rsa 


M(k, a, c) = max N(k, j, 1, a) 
j<a 
lSec 


w(k, a, c,d) = il pj exp [| px exp (4) nex, j,1, 0) 
is 

Then we can easily define a function V which has the property analogous to 
(12) for terms composed from the functions used for the 7, plus Axy x’, 
Axy(x- y), yxpx, Ay(x),, and then V is defined in terms of V, and (13) will 
imply that it satisfies (11) and, with a little more work, (12). 

This proves (a) with PR replaced by 73. ((a) then follows from the fact 
that 75 is a conservative extension of PR in the following sense: 


Lemma 5. Let Ty be the extension of Tj in which the parameters of PR can 
be of arbitrary type (the value must still be of type 0). Let ~ be a constant of - 
Ty of type (0)"0, m > 0. With @ we can associate a constant p of PR such 
that if A is a formula of Ty contaning only such constants and variables only 
of type 0, and A’ is the result of replacing each @ in A by g’, then A is provable 
in T, if and only if A’ is provable in PR. If 9 is a constant of PR then o = ’. 

Proor. We can take PR as J, with the notation so restricted that only 
variables of type 0 are admitted. In both we suppose the axioms and rules 
to be as in [10], pp. 11-12 (with ED in place of PR 1-5; the rule G6 of 
extensionality is redundant in PR). 


Given a constant w of T), we can replace it by anequivalent constant, by 
replacing those recursions PR in its definition tree which involve parameters 
of type # 0 by explicit definitions with the functional R satisfying 


R0af = a R(Sx)af = f(Rxaf)x 
(f of type (0)(0)0); i.e. if @ is defined by PR, ¢, is introduced by 
Oy XO Gy — RX a ee) AK Na ence) 


Ww = Ww, if wy is a constant of PR. Clearly if + A in Tp, then | A, (the result 
of replacing each g in A by ¢,) by a proof involving no recursion axioms 
except those for R and those with parameters of type 0. 
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Let ¢ be a term of JT, thus restricted, of type 0 and with free variables 
only of type 0. Then the normal form of tis in the set U inductively defined 
by (U1), (U2) without ft,, and 

(U3’). If t is Rsu(Axyv) and s, u, ve U, then te U where, as before, 
w,...wW, are introduced by PR with parameters of type 0. 

If @ is defined by PR, then g’ is defined by the same recursion from ’ 
and x’. Thus if we assume as hypothesis of induction that gy’ is defined 
for every @ whose definition involves fewer applications of PR than the max- 
imum for W,...W,, it follows that yj, ...W) are all defined. 0’ = 0 and 
S = S. Incase (Ul)1 —7, in case (U2) 7 is St, or Wj 2, ...2,; in case 
(U3 jl eye care the tree variables of £, we set 


Gi0ey ae, — WC aac. 
O(Syicmeane = 1,(@;, levee. ,) V6, .-.C,; 


let? DG, S'Cn Cy. 

Then if @ (of type (0)"0) is defined by ED and f¢ is in normal form, de- 
fine in PR by Gee... c, =; 

Next we note that if A has no free variables of type # 0 then a proof 
of A in Ty can be transformed into one which does not use the extension- 
ality rule ([10], p. 8) 


Pee Udy oe Uy... 2, (G6) 

P > t(u) = t(v) 
(ae, Of type 0) a, .-. . a, not in P, u,v). For in any proof in 7, we can 
eliminate all applications of G6 except those where t(u) is of the form 
at,...t,, a a variable. (Call a constant @ extensional if, where ga,...a, 
is of type 0, for any P we can derive P > ga,...a, = ob,...b, from 


P->a;=6;,i=1...n, by G6 thus restricted. By induction on the definition 
of g we can show every @ extensional and thus derive the unrestricted G6.) 

If A has no free variables not of type 0, we can substitute zero functionals 
for all variables not of type 0 in the proof below the premisses of the last 
G6, but then these inferences can be eliminated, and by a sequence of such 
steps we can eliminate all applications of G6. 

Given a proof of A in TJ, without G6, we can by pushing substitutions back 
to the axioms obtain a ‘proof’ in which no free variables not of type 0 occur. 
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For each line B, B’ is defined; the rules go over into the corresponding rules 
of PR, and it is trivial that if B is a substitution-case of an axiom, B’ is 
provable in PR. Hence A’ is provable in PR, q.e.d. 

(b). Let Z> be the extension of Zo obtained by allowing introduction 
of constants by arbitrary primitive recursion. From (a) it is obvious that 
IRJ+FAC, is a subsystem of Z>+FAC, and that IR? is a subsystem of 
Zo. To prove (b), it suffices to show that Z>+FAC, is a conservative 
extension of IR}+FAC, and Z} a conservative extension of IR*. This is 
easy to see, using the fact that if @ is primitive recursive, g(a, ...a,)=b 
can be expressed in Z, by 


(Ay)[T.(é, a1... Qn, YA U(y) = 8] 


(eis a p.r. Gédel number of @), and the recursion equations can be proved by 
2,-induction. We omit the details. 


(c). [Ai + IRZ+FAC, is obvious from (b). If Pa expresses in Zo ‘a is 
the Gédel number of a p.r. definition’ then inspection of the representation 
of primitive recursive functions in IR} shows that in IA} we can prove 


Vx <a [Px > Vyaz 7, xyz) > Vx = SaiiPx =) Vydy toy ae 


Suppose every induction axiom in this proof were provable in IR} +FAC. 
Then (since PO > Vy3zT,Oyz is certainly provable) by the closure of IR3 + 
FAC, under I7, induction Vx[Px > Vydz T,xyz] would be a theorem of it, 
But then it would be satisfiable by a primitive recursive function, which is 
impossible. 


TueoreM 3. If A is IT,, and A is provable in 1A,, then A is provable in IR* . 
Proor. Let A be Vxdy Bxy. By theorem 2 of [8] there is a constant of To 
such that B(x, ox) is provable in Ty. By lemma 5, B(x, g’x) is provable in 
PR. But then A is provable in Z> and by the argument of (b) above it is 
provable in IR}. 
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APPLICATIONS OF THE CUT ELIMINATION THEOREM TO 
SOME SUBSYSTEMS OF CLASSICAL ANALYSIS 


W. W. TAIT 


The principal result of this paper is a constructive consistency proof for 
the system (23-ADC) of second order number theory with the 23 axiom of 
dependent choice. The proof will proceed by showing that every derivation 
in this system of an arithmetic formula Y{ can be translated into a normal 
(i.e. cut-free) derivation of %{ in infinitary logic. The crux of the proof is 
the introduction of (constructively) uncountable formulae and derivations, 
using the constructive conception of uncountable well founded trees which 
is discussed in Tait [7]. In preparation for the main result, I will summarize 
some proof theory concerning (Z{-ADC) and its subsystems. This involves 
only countably infinite logic, and it yields sharper results than mere con- 
sistency. Namely it gives bounds on the provable ordinals of the systems 
considered (i.e. the ordinals of decidable well-orderings for which induction 
is derivable). This is possible because of the existence of suitable decidable 
systems of ordinal notations with which the provable ordinals can be 
measured. Takeuti [8] has been able to use his ordinal diagrams to measure 
the provable ordinals of (I1{-CA), i.e. second order number theory with the 
IT;-comprehension axiom; but so far no measure has been found for the 
stronger systems which I will consider. It seems quite possible that some of 
the systems of notations, including higher order ordinal diagrams, which 
have already been developed would suffice; but I have not yet learned how 
to use them. What remarks I do have on further refinements of my results, 
and, more generally, of proof theory, will be saved for the end of the paper. 
Mainly, I will be concerned with the problem of constructive consistency 
proofs, which is, in any case, of a different character than the (non-con- 
structively meaningful) problem of provable ordinals. On the other hand, 
I will give independent consistency proofs for certain subsystems of 
(53-ADC). For, with the discovery of suitable ordinal measures, this would 
lead to lower bounds on the provable ordinals for these systems. 
475 
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1. Second order number theory. I will use a, b, a,,... and x, y, Zz, x1,... 
to denote free and bound numerical variables, resp., and P, QO, P;,... 
and X, Y, Z, X,,... to denote free and bound set variables. The terms are 
built up from free numerical variables, 0, the successor operation ‘, and 
computable function constants. For convenience, I will assume that the 
latter include a constant for each primitive recursive definition. The *- 
predicates are built up from free set variables and constants F, F,,... 
for set valued functions of 0 or more sets. The atomic *-formulae are the 
expressions $ = t, FS, and their negations — $ = t, — F3, where 3 and t 
are terms and F a *-predicate. The *-formulae are built up from the atomic 
*-formulae by means of the operations UvB, WAB, V x(x), A xA(x), 
V XU(X) and A XY(X). Here W(x) (A(X)) is obtained by replacing 5b 
by x (P by X) in a *-formula W(b) (U(P)) which does not contain x (X). 
The negation — % of a non-atomic *-formula 2 is defined in the obvious 
way via De Morgan’s laws and the law of double negation. The logical 
axioms and rules of inference are the usual ones for classical two-sorted 
predicate logic. The non-logical axioms include the equality axioms for 
numbers, the axioms for 0 and ’, the defining axioms for the function 
constants, and the schema %(0)a A x(Q(x) > A(x')) — AxU(x) of 
mathematical induction. These non-logical axioms will be called the first 
order axioms, 

Among the additional axioms which we will consider will be certain 
restricted cases of the axiom schemata of comprehension, choice and 
dependent choice. The comprehension schema is 


CA V ZA x(Zx U(x). 


Here, and elsewhere unless it is otherwise specified, 2{(b) may contain free 
variables other than b, and the same is true for expressions 


UP... OP, beeen: 


Let <a, b> denote a primitive recursive pairing function with inverse 
c = ((€)o, (c),), where the components (c),; are primitive recursive. F*t 
will be an abbreviation for F<3,t>. The choice and dependent choice 
schemata are 


AC AxV ZU(x, Z) > VZA x(x, Z*). 
ADC AOX VY ZU XZ) WWZ(ZM PA APAMZ, Z)). 
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P = Q is an abbreviation for A x(Px @ Qx). If M denotes a class of 
formulae, then M-CA denotes CA with 2[(6) restricted to M; and similarly 
for the other schemata. 

We will use the terms formula and predicate for *-formulae and 
*-predicates which do not contain any set function constants FY, Y,, etc. 
*At and *Z° denote the sets of atomic and arithmetic (i.e. without set 
quantifiers) *-formulae, resp. *Z; is the set of *-formulae without universal 
set quantifiers, and *Z5 is the set of *-formulae all of whose existential set 
quantifiers precede all of its universal set quantifiers. Dropping * all around, 
we obtain the classes At, 2°, Z] and 23 of formulae. For k = | and 2, 
we will also discuss the *A;-comprehension rule 


\ x(x) & = B(x) 


fA, CR 
VZAx(Zx @ U(x)) 


if 2(b) and B(b) are *;, and the *4;-comprehension axiom 
*A,-CA J x(M(x) 3 = B(x) > VZA x(Zx & A(x)) 


if 2(b) and B(b) are *2,;. Dropping * we obtain 4;-CR and 4;-CA. Finally 
we will consider axioms for inductively defined sets of numbers. Let 
(P,Q, b) be a Y°-formula which contains only P, Q and Bb as its free 


variables and does not contain — P. 0 denotes a list of 0 or more set 
variables. Then 2 is called an inductive condition on P. For each such 
inductive condition 1, we introduce a set function constant Ay. The 
axioms for Py are 


rel A x(X(Px(Q), O, x) + Pa(Q)x), 
id A x(U(B, O, x) + B(x) > Ax(Pa(O)x > B(x)), 


for arbitrary *-formulae 6(6). These schemata are collectively denoted by 
Ind. It is important to notice that inductive conditions are 2°; they contain 
neither set function constants nor set quantifiers. 

Besides the first-order axioms, all of the systems which we consider 
contain *At-CA. If the remaining axioms are all instances of the schema S, 
the system will be denoted by (S). Since (S) contains At-CA and — PO is 
atomic, (S) will always contain CA for — (6) when it contains it for 
(5). *1; is the class of formulae whose negations are *2,;. Thus 
(ii,-CA)\i= (72, -CA), 
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It is evident that (*At-CA) © (*2°-CA) = (*4}-CR) & (*Z}-AC) © 
(*Z}-ADC). Also, using *2°-AC, every *Z{ formula can be put in the 
form V Z%(Z), where M(P) is *Z°. This will be called the *{-normal form: 
*Y'NF. From this, it easily follows that (*2{-AC) = (*2°-AC) and 
(*2}-ADC) = (*2°-ADC). These remarks also hold without *; but in 
this case, we get a better normal form. Quantification Vf and /A/f over 
numerical functions can be introduced in the usual way as relativizations 
of VZ and AZ. Using £°-AC every Z} formula can be put in the form 
VFA xU(f(x)), where 2(b) is ° and f(k) is the sequence number of the 
sequence (f(0), . . ., {(K—1)). This is called 2{-normal form: 2{-NF. [1}-NF 
is its negation. 

Ind’ is like Ind, except that the inductive conditions (P, 0, b) are 
restricted to the form %(0, b)v A xPo(b, x), and Ind’2 is restricted to 
%(b) which are in IJ{NF. In (*At-CA, and so in all of the systems we 
consider every instance of Ind’ can be put in I7)NF (i.e. 2{NF preceded 
by universal set quantifiers). 


(Z}-CA) = (Ind’). 


That (Ind’) & (Zj-CA) is easy, since we can define Pu(O)x o 
TRE FORO (Ze 0. x) — Zb) using II{-CA; and this definition satisfies 
Ind’ in (I7{-CA). We will prove the other inclusion: 

If k and m are the numbers of (ko,...,k,-,) and (imo, ..., m,-1) resp., 
then k#m is the number of (Ko, ...,k,-1, Mo, +--+» Mqg-1) and k~n is the 
mumiber of (kg, ,eqnua) eet xO, b) be X° and contain only 0 and b 
frees Eet5(P, 0, y= xO, b)v \ xPb~x. Then % is an inductive condi- 
tion. Let g(a) be the least c such that > Px(O)o(ay~e, or 0 if no suchc 
exists. g is arithmetic in PO) and so it exists in (Ind’). (This uses the 
fact that Y°-CA is contained in (Ind’).) Also + V x(O, y(x)) > PO) 
in (Ind’), where 1 is the number of the empty sequence, and so 
LA FV xQ(O, f(x)) + Py(O)L. Conversely, let ©(b) = AfV x%(O, b*f(x)). 
©(b) is in TI{NF and clearly + Ax(B(, 0, z) > G(z)), so by Ind’ 2 
f PO) — @(1). But C(1) is equivalent to AfV xu(0, f(x)) in (Ind’). 
So we have PO) o AfV xU(0, f(x)) in (Ind’). Now let (4) be in 
TI;NF. Then %,(b) can be put in the form AfV xU(O, (0), f(x)); so 
t Az(Pu(O, {z})1 > o(z))in (Ind’). Hence (by *At-CA) t VZA x(Zxo 
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Yo(x)), ie. 2} NF-CA is contained in (Ind’). But using this schema we 
can derive 5°-AC and using the latter, every 2; formula can be put in normal 
form. This completes the proof that (Ind’) = (Z}-CA). 

It also follows from the above that every 3-formula can be put in *Z} 
form in (Ind’). Thus (43-CR) © (4j-CR+Ind’) and (23-ADC) ¢ 
(*2}-ADC+Ind’). The proof that (Z}-CA) ¢ (Ind’) also shows that 
(Ind) contains the axiom schema of bar induction BI: 


AF V xU(O, f(x) a A x(Q(O, x) > C(x) A Ax(A yE(x7y) > C(x) > C(I). 


For define the inductive condition 8(P, 0, b) as above; we saw that the 
first conjunct in the antecedent of BI implies Ag(Q)1, while the second and 
third imply A_(Q)1 > C(1) by Ind 2. 


§ 2. Countable logic. The countable *-formulae and *-predicates are built up 
as follows: T and — T are *-formulae. The *-formulae are closed under set 
quantification, denumerable disjunction V 9, and denumerable conjunc- 


tion A %,. Each free set variable is a *-predicate. If A has n argument 


Piccsmand hippy. ate predicates, then so is P(F,,...,F,). If 
WM, W,,... are *-formulae which contain no quantifiers, then 2x, is a 
*-predicate. If F is a *-predicate, then Fn and — Fn are *-formulae for 
each n. We will identify Ax%U,n with %, and — Ax%U,n with — Y,,. The 
countable formulae and predicates are the *-formulae and *-predicates which 
contain no set function constants. The atomic (*-)formulae are those of 


the forms T, Pn, P(F)n, and their negations. 

From now on I will drop the term countable and refer to the (*-)predicates 
and (*-)formulae of §1 as finitary. But we will only consider finitary 
(*-)formulae which contain no free numerical variables. Finitary (*-)predi- 
cates and (*-)formulae form a subclass of the (countable) (*-)predicates 
and (*-)formulae, if we make the following identifications: V x(x) = 
VU(n); A x(x) = A Wn); if 3 has the value n, we identify it with the 


numeral n; n = n is identified with T; and for m 4 n, m = nis identified 

The rules of inference for countable logic are most conveniently stated 
for finite sets [, A, [',, etc., of *-formulae. These should be regarded as 
expressing disjunctions of all their members. [+4 denotes the union of 


I and A. The axioms are T (i.e. (T)), Pa+ — Pnand P(F)n+ = PAF )n, 
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for all P, Y, n and lists of *-predicates F. The xormal rules of inference are 
A I+A (if 4 is an axiom) 


V r+%,  (somek) A r+%, (all k) 


r+V 4, P+A%, 
3 1 +U(F) V r+(P) 
r+ V ZU(Z) + AZZ) 


In V, P does not occur in any member of I. In J, F is a *-predicate, called 
the predicate of quantification. Besides the normal rules, we have the cut rule 


r+ r+— 2 
r 


C 


YM and — Y are called the cut formulae. (Since — — 2 = A, this rule is 
symmetric. ) 

Let F = (®o, B1,---) be a set of *-predicates. ¥ is closed if it contains 
the atomic predicates and if whenever %, = G(P), then for every n, G(%,) 
is init. Note that F need not contain any atomic *-predicates. For example 
At = the set of Ax2(x) and 2° = the set of AxB(x), where (0) is finitary 
atomic and %(0) is finitary 2°, are closed. If F is closed, and we will 
assume always that it is, unless specified otherwise, (7) will refer to the 
system with the predicates of quantification restricted to ¥. 

To measure the length of derivations and the complexity of cut formula, 
we introduce a decidable linear system of ordinal notations. These will be 
denoted by u, v, x, y, z, u,, etc., and the ordering relation by <. This is 
to be decidable; and, moreover, the system is to be closed under the function 
x(a), where y°(u) = 2" and for z > 0, x7(u) is the (u+1)st fixed point o1 
all the y* with x < z. xy is of course to be computable on the notations. 
Such systems of notations are well-known. Let < denote a well-founded 
partial ordering of a, B, y,.... B < z (B is of rank < z) is inductively 
defined to mean that, for each « < B there is an x <z with ax x. 
B < z means that 6 < x for some x < z. The class of *-formulae and the 
class of derivations can be regarded as wellfounded partial orderings, 
built up from the atomic *-formula and from the instances of A, resp.. 
So the rank relation makes sense for both of these classes. Note that atomic 
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*-formulae are of rank 0 (i.e. < 0), even in the case PAF a, where F may 
consist of very complex predicates. We say that a derivation is of cut degree 
< v if every cut formula in it is of rank < v. + A[u, v] means that there 
is a derivation of A of rank <u (not < u, as in Tait [6]) and cut degree 
< v. t Af{u] means that there is a normal derivation of A of rank < u. 
eat) sus Ol 
I. If + Wf in (*At-CA) (in *2°-CA), then + WU [w?, k] in (*At) (in (*2°)) 
for some k < w. Similarly without *. 
This is essentially proved by Schiitte [5]. There are two other results which 
form the basis for the present paper. ¥ <z means %, <z for each k. 
Il. Jf + A[u, v] in (F), where F <z and z+v < w+... +@™ then 
er a eee (22)... ) | ie (# ): 
Let < bea decidable well-ordering, and let 


J(c) = Ax(Ay(y < x > Py) > Px) > (A xPx. 


So J(<) expresses well-foundedness. Let |<| < z mean that ¢ is well- 
founded and each n is of rank < z in the ordering c. 

Ill. If + J(<)[xz'(x)] in (F), then |<| < x'(x). 
The proofs of II and III are in Tait [6]. 


THEOREM |. & is the ordinal (i.e. least bound on the provable ordinals) of 
(*At-CA), and ¢,, is the ordinal of (*2°-CA). 


As usual, ¢, = x'(1+z). That ¢9 and ¢,, are bounds on the provable 
ordinals of these systems follows from I-III, since *At < 1 and *2 < o. 
The two parts of the theorem are due to Gentzen [3] and Schiitte [5] resp. 
(Note that (Z°-CA) is simply the lowest level of ramified analysis.) In the 
following sections we consider some further applications of I-III. 


3. Some subsystems of (*Z}-ADC). W(F) is the result of replacing each 
part VZ8(Z) and \ZB(Z) of 2 by V B(F,) and A B(F,) resp., having 
ie k 


first made these substitutions in G(P). 
C4) zie =<azelie. F, <2 for each A) and Ys u. Let 
*F . = *At. For limits z, let *F* enumerate |) *¥,. Let*F,,, enumerate 


x<Z 
the least closed class including *¥, and containing 1x(*F,)(x) for each 
finitary *Z} formula Y(b). It is easy to show that **, < w*. Dropping * 
we obtain the sets F, < w’. 
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Lemma |. If + & in (*Aj-CR), then + U[x°(0)] in some (*F,,). Similarly 
without *. 
The *-less proof is in Tait [6]. The proof with * is just the same. So 


THEOREM 2. The ordinal of (*A{-CR) is x°(0). 
The fact that 7°(0) is the /east bound on the provable ordinals follows 
easily from (6.19) of Feferman [1]. 


THEOREM 3. The ordinal of (*;-AC), (*2}-ADC) and (*4j-CA) is y°(0). 

That this is a bound on the provable ordinals was first proved by 
Friedman. That it is the least bound for (2}-AC) follows easily from (6.19) 
of Feferman [1]. That it is the least bound for (4j-CA) follows from 
Friedman’s [2] result that (5}-ADC) is a conservative extension of (4j-CA) 
for IT; formulae. In Tait [6] I obtained 7°°(0) as a bound for (2j}-AC) by 
showing that every proof in this system of a 2{-formula preceded by universal 
set quantifiers can be translated into a normal derivation of rank < (0) 
in some suitable (¥) with F < e. The proof for (*2{-ADC) is essentially 
the same; but I will give it without the ordinal bounds. 

Let (P,Q) be finitary *2° and ¥ = F(P) a set of *-predicates 
F, = F,(P), k > 0. Set 


H = H(F(P), P) = ax V (,(PF,)A A — UP, Fy) AF, x). 
n k<n 


Then 
t V U(P, F,) > U(P, H). 


Let F be**At. Set G, = F and G, — 4@(G,),G,). The *-predicar. 
G = AxG,),(x), is called the dependent choice predicate for U(P, Q) and 
F relative to F. Substituting G” for P in the above derivation, we have 


t V (G", F,(G")) > 2(G", G’*") (1) 


for all n. Let * F” be defined like *.F, above, except that *F**! enumerates 

the least closed class which includes *¥%* and contains the dependent 

choice predicate relative to *¥* for each *X° formula Y(P, Q) and each 

F in *At. 2° and Y(,) will always denote *-formulae of the form V W(F;,), 
k 


where the , are (not necessarily closed) lists of *-predicates in *F" 
containing all the predicates which occur in %. If MW is *Z{, then 
FY" — W*F™). A” and Aj,, denote sets consisting of *-formulae QI" 
where 2 is in 4. Note that each A” is in AY when u < v; and if F I, + dq) 
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for each n, then there is a single A" with + I,+ A” for all n. Also, if 
Poll Pinna.) + Ae eee sate not in-any formula ef A, and 
G,,...,G, are in *#", then t W(G,,..., G,)+A(o) for some Ao) simply 
by substituting G; for P; throughout the derivation. 


LemMA 2. Let I consist of *-formulae —%S where $8 is an instance of 
*¥°-ADC preceded by 0 or more universal set quantifiers; and let A consist 
Of formulae of the formm | Nawrek VG ae. YX os Xms Visco t,) 
where % is *Z;. Then + [+ A[u] in (*At) implies + A” for some A”. 

Proor. The proof is by induction on u. The only case which does not 
follow immediately from the induction hypothesis is when the last step of 
the derivation is of the form 


P'+At+AXVYMX,Y)  ’'+A+AZ(4Z° = Fv AMZ", Za 

ee SS i a 
where P = I'’+ A XV YU(X, Y) > VZ(Z°-Fa AU(Z", Z")). 

There are normal derivations of the premises Ss rank < v for some 


v <u. By the reduction lemma of Tait [6] [’+4+ V Y2(P, Y)[v] and 
bI’+A+ = P° = Fv V —%(P", P”)[v], where P is a new variable. By 


the induction hypothesis + 4+ V YU(P, Fy), where the Fy are the elements 
k 

of *:¥", and | 4°+ = P° = Fv V 4%(P", P"). Substituting the choice 
predicate G for A and F relative to *¥” for P in these, and applying (1), 
+ At, +20(G", G") for each n and | 4°+ 4 G° = Fv V 4 %(G", G"), 
since G is in *#". We can assume A(,, = A" for each n, and we have 
+t G° = F. So by two cuts t A”. 

It is not hard to see that *#" < w** and to obtain the bound x°(0) 


on the derivation of A“ above. Note that the (*2;-ADC) case of theorem 3 
is an immediate consequence of the lemma (with the bounds). For | %f in 


(*Z;-ADC) = (*2°-ADC) means that | A 6; 2% in At-ADC), where 
i=1 


each ¥, is an instance of (*2°-ADC) preceded by universal quantifiers. 
Le. t {B,,.... 4 B,, W}[eo] in (At). When % is *Z{, we can apply the 
lemma to obtain + Y[7°°(0)] in (*F") for some u, since t 9" implies + A 
in (*F") when A is *Z}. 


4. Uncountable logic; restricted upper bound principle. The species T, of 
well-founded trees of class z are defined by recursion on Z: Tg is the set of 
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natural numbers. For z > 0 T, is inductively defined by: <z,0> ¢T_, and 
if x < z and fis a constructive operation defined on T,, 


V,€1,(fneT,) > <z,x,f> eT,. 


See Tait [7] for a discussion of a constructive conception of such inductive 
definitions. Under Church’s thesis, the well-founded trees can be coded by 
numbers; but of course T, is uncountable (constructively) for z > 0. The 
T, are closely related to the so called constructive higher number classes of 
Kreider—Rodgers [4], but are simpler because we do not require that the 
subtrees of a given tree be simply oidered. m, n, k, m,, etc., will range over 
well-founded trees from now on. 

We extend infinitary logic to uncountable formulae and derivations by 
introducing disjunction V* and conjunction /\” over arbitrary T,. Thus 

n n 


\V’° and A ° are the countable operations V and /\ introduced above. The 
rules of inference are the same as above, with the obvious modification of 
V and A. A formula is of V -class (/\-class) z if x < z whenever V* (/\*) 
occurs in it. It is of class z if it is both of V-class z and of /-class z. 
m én will mean that n is of the form <z, x, f> and m = fk for some ke T,. 
IfneT,, then V and / are of class z. Put 


men men 


Vy = Vm v Va) 


and A" = — V" —. If neT,, then V" and (\” are of class z. 
k k 


The formulae and proofs which will interest us are too big to measure 
with countable ordinals (at least, in the present very general setting.) So 
we will measure them with well-founded trees instead. The rank relation 
is defined just as with ordinals. 

IV. If + A[m, n] in (F), then + A[p] in (F) for some p. 

The proof is exactly the same as in the countable case and is well-known. 
It proceeds by induction on n, and within that by induction on m. In the 
present setting p is of no significance. This is because it is generally un- 
countable, whereas the ordinals of formal systems are < @,. I will return 
to this point below. 

For the remainder of this section, we will be considering only proposi- 
tional logic i.e. the quantifiers Vz and (Az ale dropped. 

Let A(\V") denote the result of replacing some occurrences of V7 in 
formulae of 4 by V” (where neT,). As a consequence of IV, we have 
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Lemna 3. If + A and A is of /\-class z, then + A(V") for some neT,. 
ProoF. This is easily proved by induction on the rank of a normal deriva- 
tion of A, noting that each step of such a derivation is of /\-class z. If the 
last step is by V applied to one of the designated \V/*, then we can assume 
that it has the form 4+, => 4, where k E T,. By the induction hypothesis 
+ A(V ?)+2%,.(V?) for some peT,. The lemma follows by choosing n 
with k and p en, notingthat + 4( V ?) implies t A( V") when p € n. The only 
other crucial case is when the last step is +, (all keT,)=> 
bE P+A*%, = 4. We have | r(V**)+%,(V%") for each keT,, and 
k 


Since 41s of /\-class z7, x < z. SO we can take n = <z, x, f>. 

The main result we need concerns the restricted upper bound principle 
RUB ; De NV AEE, 

mon Pm in 

where x < z and each Y,,, is of class z. The corresponding rule (from the 
antecedent to infer the consequent) is valid for the normal rules by lemma 3. 
But RUB is not itself provable in our system, for it is non-constructively 
false. Let Y,,., Mean that n is the mth element in some non-constructive 
enumeration of T,, x = 0 and z = 1. RUB” denotes the restriction of RUB 
toz < v. Let B® = V"%,, where each 6, €e RUB. 8™ denotes the disjunc- 


k 

tion of all the — %, with §, e¢ RUB’. 

V. If }8+Alp], where A is of A-class u and each 8, € RUB‘, then 

FBO +A. 

We assume that this is true for all u, p and v where (i) v < v’ and p <p’, 
or (ii) Vv < v’; and we prove it for v = v’ and p = p’. The induction hypoth- 
esis (i) obviously takes care of all cases but when the last step in the 
derivation is 6+ —%,+4= 8+4, by V, where %, is RUB’—RUB". 
Le. B, = A* V7Men 7 V7A*V mn» Where the premise has a normal 


m n 


proof of rank < q < p. Replacing — %, by its conjuncts, we get 


(1) io Ae A*V nal a], 
(2) -B+A+V*A Ural] (all reT,). 


By (i) and (1) + B@+A4+A*V 7g, since 4+ A*V 7%, is of A -class z. 
So by lemma 3 + 8%+4+A*V"%,,, for some reT,. By (i) and (2) 
+B@+A+V*AT —Y,,,: and so by a cut, # B+. Since B, is RUB? 
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for the — %,, which are disjuncts of 6 andz < v, + 8 +4 follows by (ii). 


Since RUB! is empty we have the following 
CoroLiary. If A is of (-class 1, then RUB A implies + A. 


We will use RUB to study the theory of inductively defined sets. First 
we need 


Lemma 4, Let 8(P) be of class z and not contain — P. Let X,(b) be of class 
z for each kéT,, and assume that for each such k and each mek, 
+ O(n) > A,(n) for all neT,. Then RUBt B(V 72,) > V 7B). 

k k 


Proor. The proof is by induction on 8(P). If B(P) = Pa, there is nothing 
to prove. Assume that it holds for %,,(P)’ for each me T, wheie x < z. 
Then it clearly holds for V*%,,(P). By RUB 


AB al V 72) > A*V 78 ,(2L,) + V7A*S 2b) 
m k m ik ko om 


since V PBm(s) = Ba(Ap). 


In what follows (PL) will denote the system of propositional logic of . 
class z; i.e. which involves only formulae which contain no quantifiers 
and are of class z. It is obvious that each formula and derivation of (PL’) 
can be coded by a tree in T,. 


5. Theory of inductively defined sets. Let 2(P, 5) be of class z and not 
contain — P. Set 


Pb) = VV UP", b) 
mek 


for each ke T, so that P* is of class z. Set 


Pb) = \V?P"(b). 


k 


Choose m with kem. Then + U(F*, b) + P™(b), and so + W(P*, b) > Ab), 
for all ke T,. So by lemma 4 


(a). RUB | AUP, n) + An). 


Also, using the fact that — P does not occur in 2(P, b), 


(b). tA (218, n) + B(n)) + A (Pa) + Bn). 
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To each propositional *-formula 8 we assign an interpretation 8 by 
replacing each part Py,(F) by the disjunction ee 28) as defined 


above for U(P, b) = Y(P, F, 6) of rank < z. Note that *At is Ge class @ 
(i.e. each of its members is of finite class), and *¥, is of class w? 


THEOREM 4. If © is 2°, then every derivation of & in (Ind) (and so in (Z{-CA)) 
can be transformed into a derivation of & in (PL). 
For | © in (Ind) means + D > © in (*At-CA) where ® is a finite con- 
———, 


junction of universal quantifications of instances of Ind. So + D(*At) > € 
in (PL®). But D(*At) is a conjunction of instances of (a) and (b). 


TuHeorEM 5. If © is £°, then every derivation of © in (A}-CR) transforms into 
a derivation of © in (PL®”). + © in (43-CR) implies | D > © in (*A}-CR), 
where D is as above. 

By lemma 1+ D > Cin some (*F,),k < w.Sot D(*F;,,) > Cin (PL®”). 


THEOREM 6. If © is 2°, then every derivation of © in (2}-ADC) transforms 
into a derivation of & in (PL*?). 


t © in (X3-ADC) implies + A Da A ©; > in (*At-CA), where 
i=1 i=1 
D, and ©, are universal quantifications of instances of *Z}-ADC and Ind’, 


resp. So +7>~D,+...+ 3,+ A&E, 7C[u] m (FAt), for some 


i=1 
<2). 
It is easy to see that each instance of Ind’ can be put in the form 
pr elgg Y(t. X41, 05; Y,) where Wis *2°.. So 
each ©, is of this form, and hence /\ G; — © is the negation of this form. 


i=1 
~~ 


Applying lemma 2, + A G? > ©. But G? is a conjunction of instances of (a) 
and (b) and is of ds Ou. Sole Oan(Pi-s |: 


6. Remark. The statements of theorems 4-6 suffer from the defect that, 
if a set of Y° formulae is derivable in any PL(z), then it has a normal deriva- 
tion in PL(1). What is really wanted for these theorems is a statement of 
exactly what constructive methods are used in transforming the formal 
derivation into a normal derivation in PL(1). The formulae and derivations 
in PL(z) can be coded in a natural way by elements of T,; and all of our 
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arguments can be recast as definitions of operations on T, by recursion 
and free variable proofs by induction on T,. I will not give the details here; 
but the constructive principles involved are discussed explicitly in Tait [7]. 
One consequence of giving such a complete description of the constructive 
principles involved is this: All of the operations on trees which we use are 
introduced by a finite set of definitional schemata. The constant terms built 
up using these schemata form a decidable model TZ for the T, (x < z) 
and the definitional schemata. Indeed this model arises by restricting 
<x, y, f> in T, to the case in which f is obtained by these schemata. Thus 
the partial orderings given by the elements of T7, and in particular the 
formulae and derivations of PL(z)*, are recursive well-founded orderings. 
An investigation of the ranks of these orderings should lead us to bounds 
on the provable ordinals of the formal systems considered in theorems 4-6. 
For example, in view of Takeuti’s result, we should expect that the ranks of 
elements of we are bounded by ordinal diagrams of order w+1. 
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PRINCIPLES OF PROOF AND ORDINALS IMPLICIT IN GIVEN 
CONCEPTS 


G. KREISEL* 


This lecture concerns the principal’ defect of existing proof theory stressed 
in the survey [18], particularly p. 323 or p. 360, the lack of a clear and 
convincing analysis of the choice of methods of proof; more specifically, 
of the choice of formal systems to be studied and of the metamathematical 
methods to be used in this study. Many metamathematical problems such 
as the consistency problem depend on such a choice. The ultimate aim is 
nothing else but the discovery of objective criteria for such a choice.* As 
indicated in [18], what one is after is a (phenomenological) description of 
certain kinds Y of mathematical reasoning; the objective question is then 
simply this: whether the proofs represented or described by derivations of 
a given formal system ¥ are in FY (soundness of ¥ ); whether all proofs in 
Ff are represented in ¥ (completeness with respect to F or the weaker 
condition of completeness with respect to provability in #). 
The particular kinds of reasoning considered in the present lecture can be 

roughly described as follows: 

What principles of proof do we recognize as valid once we have 

understood (or, as one sometimes says, ‘accepted’) certain given 

concepts? 
The process of recognizing the validity of such principles (including 
principles for defining new concepts, that is, formally, of extending a given 
language) is here conceived as a process of reflection; reflecting on the given 
concepts, reflecting on this process of reflection, and so forth. It is not 
assumed that every significant area of mathematics is properly analyzed 
in this manner; not even all those areas which may be described as: what is 
implicit in given concepts. For instance, if the basic concepts involve a 
very high degree of self reflection. 

Granted that we have to do with an area Y which lends itself to the kind 
* Preparation of this paper was partially supported by Grant DA-ARO-31-124-G985 
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of analysis indicated, it is evident that ordinals play a basic role. They index 
the stages in the reflection process. A moment’s thought (for people not 
totally ignorant of classical set theory) shows that some basic distinctions 
in the meaning of the notion of ordinal are needed corresponding to the 
concepts considered as given; distinctions parallel to the familiar ‘classical’ 
material summarized in technical note I. 

The two principal sets of concepts considered are: 

1. The concepts of w-sequence and -iteration. 

2. The concepts of set of natural numbers and numerical quantification. 
Though related to earlier work on autonomous progressions ((16], pp. 
168-173 for ‘finitist? mathematics, [4] for ‘predicative’ mathematics) the 
present paper involves an essential change in that it separates two distinct 
questions which were previously treated simultaneously. 

One question is, so to speak, the theoretical question: what is implicit 
in given concepts? perhaps to be compared with a question of the sort: 
what (physical or mathematical objects) can be built up by using given 
basic material as fully as possible? The other, so to speak, empirical question 
comes about as follows: There are objects which we have come to recognize 
under a certain name or by external properties of which we are more or less 
explicitly aware (say solids in physics, algebraic proof in mathematics etc.). 
Do these objects coincide with those generated by ‘theoretical’ or, perhaps 
better, more explicitly analyzed processes? 

Evidently, the kind of separation here considered would be out of place 
at an early stage of the work except when one happens to deal with childishly 
simple situations. For instance, if the concepts needed to answer the 
theoretical question are already available or if some very familiar properties 
of the empirical material determine all its laws (subject only to very general 
assumptions; cf. the discovery of the science of geometry). In general, one 
has to be prepared for a detailed theoretical development before one can 
even begin to compare theory and experience precisely. Before that the 
choice of theoretical notions is, more or less, suggested by the empirical 
material, and modifications in the empirical notions are suggested by the 
shape of the theory. 

But it seems to me that, in our subject, it is not unreasonable at the 
present time to attempt a comparison. In technical note II, I shall go over 
some of the evidence relating Hilbert’s notion of finitist proof not with the 
area 1 described above, but with the principles formulated in primitive 
recursive arithmetic. 
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1. Introduction (not needed for understanding the proposals and results 
below which ultimately must be judged on their merits). Our first aim is 
to make the study of informal notions of proof plausible. Put differently, 
since the raison d’étre of anything like existing proof theory seems to rest 
on such notions, the aim is nothing else but to make a case for proof theory; 
a subject which in point of fact is not popular. At least in my opinion 
no useful purpose is served by attributing this (sociological) fact to such 
fictions as a prejudiced logical ‘establishment’ or an ‘apathetic’ body of 
logical workers. At present a good number of proof theoretic methods are 
available (and thus a long list of both solved and open formal problems); 
as anybody familiar with the subject knows, some of these methods are 
ingenious. What is lacking is a good answer to the objection: why pursue 
proof theory now when we know that its original aim (Hilbert’s program) 
was based on false assumptions about the nature of mathematical reasoning? 

The first point to note is that the question of characterizing notions of 
proof has presented itself repeatedly in the history of mathematics (and 
is thus, literally, a natural question). The current distinction between, 
say, projective and affine theorems, in the sense of: valid statements about 
all models of the axioms of projective and affine geometry, corresponds to 
an earlier distinction between projective and metric methods of proof. It 
was a discovery that this distinction can be analyzed in terms of generality 
of the results; more technically, in terms of a restriction on axioms used 
(rather than, say, in terms of rules of inference); of course there is no reason 
to assume that every legitimate distinction between different methods of 
proof admits this kind of analysis.° 

In number theory, serious workers spoke of a distinction between 
elementary and analytic proof, and, more specifically, between proofs using 
real or complex analysis; see, e.g., Ingham’s monograph [11]. Their formula- 
tion was illiterate (if one remembers that complex numbers are, traditionally, 
regarded as pairs of real numbers): but this in no way means that it had no 
substance. 

In [13], p. 248 I proposed an analysis of this distinction roughly as 
follows. We consider quantifier-free theorems T and call a (quantifier-free) 
proof of T elementary if it contains only the function symbols occurring 
in T, their definitions (including thus the auxiliary functions used in these 
definitions, e.g. multiplication in the definition of factorials), and some 
basic minimal apparatus. The restriction to quantifier-free, that is logic- 
free, reasoning is clear: the set-theoretic operations involved in the meaning 
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of quantified propositions are, roughly, what’s typical of analytic proofs! 
Shepherdson [26] discovered several familiar theorems which, demonstrably, 
do not have elementary proofs in the sense above. 

The defect of my formulation was, of course, not the mild ambiguity 
in what counts as ‘basic minimal apparatus’, but simply that it was too 
formal. Why restrict oneself only to the function symbols actually occurring 
in 7, and not allow all that is implicit in the operations denoted by these 
symbols?* Though (or: because) subtler, the kind of notions of proof 
considered in the present lecture may permit a more significant formulation 
of the old distinction between elementary and analytic proof in number 
theory. 

At the present time the legitimacy of this old distinction is not in doubt. 
Also Gédel’s incompleteness theorem, with an absolute minimum of 
assumptions on the details of the distinction, shows the existence of theorems 
stated in elementary terms which do not have an elementary proof.* But a 
natural modification of the old question is open, namely: 

What can we say about the kind, say syntactic structure, of elemen- 
tary theorems which may need analytic proofs? More generally, 
we should here replace ‘analytic’ (i.e., real variable proofs) by ’set 
theoretic’ (i.e., proofs involving axioms of infinity). 
Let it be noted that people currently working in number theory do not seem 
to analyze their proofs or conjectures in ‘logical’ terms such as those above. 
But they are very sensitive to the kind of considerations that may be essential 
to a particular problem.® 

One last word of introduction. Perhaps the single most striking aspect 
of distinctions between notions of proof is a certain finesse (which, according 
to one’s intellectual temperament, may be satisfying or frustrating). 
Logically, as is well known, the simple minded idea is that we want our 
proofs to be ‘reliable’; (cf., e.g., [18], p. 361(ii)) or quite explicitly (as 
Hilbert once said [10], p. 157) that Zermelo’s axiom of choice be ... 
reliable in the same sense ... as 2+2 = 4. But though the degree of 
reliability may be the same (for instance: certainty) the kind is different 
in the two cases; c.f. [18], p. 361(ii). Mathematically, the simple minded 
idea is that since (quite properly) our primary purpose is to find out what 
is true, we don’t want to be bothered with distinctions between different 
kinds of proof. But this overlooks the following point which, at least for 
some, can be practically useful: if we have (mastered!) theoretical reasons, 
for instance abstract interpretations, restricting the kinds of proofs (or kinds 
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of conjectures) that are liable to be successful, we have in fact a better 
chance of discovering the truth. 


2. Ordinals. Quite naively (and also formally in set-theoretic foundations, 
cf. Note I) we distinguish between several more or less related notions 
(quite well illustrated by ‘the’ notion of finite ordinal): 

The process of building up the integers by a successor operation say o, 
with the property that ox is distinct from x and from all y built up before x. 
This process determines an ordering (= built up before) between the 
integers, and hence an abstract order type <. 

A much used property of the order type < is its well-foundedness, usually 
formulated either with variables X over subsets of our ordering or (monadic) 
variables f over functions from (canonically given) integers into our 
ordering, where n* is the canonical successor of n. 


VX{¥x[(Vy < x)(ye X) > (xe X)] > Vx(xeX)]’ or Vf3n—(fn™ <fn). 
Clearly, the meaning of these principles involves the meanings of 


subset X and the logical operations V, > 
@-sequence, and the quantifier combination Vfin 


Trivially, so does the relation between the two principles, and the relation 
between the abstract order type and the building up process. 

Illustration. There are primitive recursive w-orderings (in the usual sense) 
for which well foundedness can be proved in the most elementary way, 
but we do not know the first element.® 

More important, for a classical interpretation of the quantifiers: VX 
and Vf, the formulations of well-foundedness assume a well defined range 
even for the variables Y and f.? 

In contrast, in terms of the notion of natural number and specific w- 
sequences we can express quite straightforwardly the stronger’® assertion 
that some given ordering < has been built up from an (explicitly given) 
Xo, by means of iterating an explicit o finitely often. (In fact, for variable 
x over our ordering 


Gx #X,, OM =x, CX =OYOx=y 


[n th 


where o'"! is the n iterate of o, and x is a numerical valued function 
defined on the ordering.) Put in more ‘mathematical’ terms, if we want to 
express the idea of an ordering built up by means of a process, we have 
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to think of an algebraic structure, an ordering together with certain functions 
on it (and usually the latter are primary in that they alone may be enough 
to determine the ordering). 

Now the notions used above in formulating the process of building up 
q-orderings also allow the analysis of larger orderings. Our problem is 
to make the analysis sufficiently precise to establish /imits to this process. 

Some of the principal results take the form: 

If an algebraic structure 9{ can be obtained by the process in question 
so can F(2l), where F is some (familiar) operation. Needless to say, ‘corre- 
sponding’ results for the different notions of ordinal at the beginning of 
this section may be incomparable. Specifically, compare orderings built up 
by some process and orderings seen to be well founded (in both cases, by 
methods implicit in given concepts). Even if a larger class of orderings may 
satisfy the second condition, the former may be closed under ‘stronger’ 
operations, simply because we know more about an ordering when it has 
been built up in a prescribed way.'* 

In terms of the distinctions above, it will be possible to deal quite quickly 
with the ordinals implicit in two important groups of concepts related to a 
constructive and a non-constructive use of the notion of natural number 
(or, more precisely, set of natural numbers). 


3. Ordinals built up by @-iteration starting with the notion of w-sequence 
(and two distinct objects of course). The essential formal problem is to 
express properly the idea that some ordering has been built up by (iterating 
the process of) w-iteration, starting with the natural ordering of the integers 
(the latter is determined uniquely up to isomorphism, as explained in the 
last section, an isomorphism built up from x9, 6, 1; Xo, o’, x’ if the orderings 
< and <’ satisfy the conditions given). Jf an ordering has been seen to be 
built up in this way, proof by induction on it or definition by recursion are 
then evident consequences.‘* 

(a). Multiplication by w: the basic step. Given an ordering (say of natural 
numbers) and an element €,, it is perfectly clear what quantifier-free 
information expresses that the ordering is built up from its initial segment 
(preceding €,) by adding w copies of itself: 

a sequence €,, ¢,,..., and a numerical valued mapping v defined on the 
ordering such that ¢,, < y < €,,),,; mappings o(n, x), 2(n, x) such that, 
for each n = 1,2,... @ maps the segment €, onto [€,, €,4,) preserving 
order (where z is used to express ‘onto’, i.e. 
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n(n, x) takes some conventional value if x ¢ [€,, 6,41), 


En, cae) — x for x preceding ¢;). 


Note that the definition is meaningful for any ordering (indeed any binary 
relation). It preserves the basic property of uniqueness up to isomorphism 
as follows: 

Given two orderings <, <’, sequences ¢, v, o, n and €’, v’, o’, n’ satisfying 
the conditions above, and order preserving mappings p, pw’ between the 
SEAUICHicm Va And GC ye Crmiben fi, 7 can be uniquely 
extended to mappings between the whole orderings.’* 

(b). Iteration of the basic step. The general idea is clear enough. We start 
with the ordinal w; as always, ‘ordinal’ in the sense of an algebraic structure 
together with assertions expressing that the ordering has been built up in 
the way intended (here: w-iteration of a successor operation). Suppose now 
a is an ordinal built up by -iteration; then we get (the ordinal, in the sense 
above) w”* by iterating the basic step « times; in particular, since « was 
seen to have been built up as required, the usual definition of w* by recursion 
on a expresses that the w-sequences (functions) are uniquely defined. 
Putting @, = @, @,,, = w°", we thus get to w, for each n = 1,2,....'* 
I shall set out elsewhere in full (and, [ hope, agreeable) detail the whole body 
of operations and assertions about them that constitute the ordinal o, in 
our sense. 

Now it is obvious that we do not get to €, by the particular process of 
reflection just described; we’d have to recognize % for which ¢ = o” 
as an ordinal before recognizing &) as an ordinal; but a» = &p itself. So 
the open question is whether we can get to & at all by means of principles 
implicit in @-sequence and w-iteration. A really conclusive (negative) 
answer will be given in section 6 when enough machinery is developed. But 
(as so often) it is good to analyze what we can say about the matter at the 
present stage. 

Note first that there is no doubt which ordering, if any, for € is to be 
considered; nor of the functions, say x @ y, exp, on the ordering to 
represent addition and exponentiation to the base @; that is, if |x| is the 
order type of the segment preceding x, 


Ix]. 
’ 


|x ® yl = |x|+lyl, lexpy x] = @ 


in particular, we have a sequence w,, for which |w,| = @,. Any ‘conceivable’ 
set of recursion equations satisfied by our ordinal functions can be formally 
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derived. Similarly our ordering can be seen to be the /imit of the w, since 
we easily get a numerical valued function 1 such that x precedes w,,. What 
is missing is the recognition, by means of principles implicit in the concepts 
considered, that our ordering is built up in the way intended.‘* 

The main issue can be put as follows: Let Q(a) mean: |a| is an ordering 
that can be built up by iterating the process of w-iteration. We certainly 
do not dispute the truth of 


(*) Val Q(w,) > Q(w,+1)]- 


In fact, we admit VnQ(w,) (granted that Q is meaningful). Can we infer the 
truth of Q(lim w,)? What stands in the way? 

(i). Of course we should have Q(lim w,) if (*) were proved by means of 
the principles considered. 

But is there any stage, in our reflection on the notion of q@-iteration, 
where Q is even defined for all w, (on our understanding of the concepts 
used)? Let alone where (*) is so proved? It is precisely this matter which 
requires the close analysis below of the representation of proofs implicit 
in our concepts.'® 

(ii). Clearly, we should also have the truth of Q(lim w,), granted the 
truth of VnQ(w,), if the length of the sequence w, were significant. In other 
words if the truth of Q(lim x,), for « < B, followed from the fact that f 
is a small ordinal. 

Consider, in place of Q, the set theoretic property Q,,: to be an infinite 
cardinal < %,. The sequence X_, 1 — 0) 1,223 1s short. and, jon cache. 
Q,(&,,). One would be ill advised to conclude 2,,(%,,). 

I do not know if this comparison is a mere debating point. Be that as 
it may, we certainly have no evidence for the proposed inference (it). 

Digression. If one is interested in the kind of reason that can be given for 
(ii), we can, of course, look for the application we wish to make of our 
‘theoretical’ notion of proof (in the sense of p. 491); I think we get an 
obviously negative answer to (ii) if our interest is that described in [16], 
pp. 169-173, namely to obtain limits to what we can learn to visualize, 
given the ability to visualize a sequence of w-copies of a configuration which 
we can visualize. But there is also a more theoretical way of looking at the 
matter. . 

Even granted that é, is not implicit in the concept of w-iteration, we now 
ask ourselves: what is implicit in the given concepts together with the concept 
of reflection on these concepts? (reflection of the kind analyzed above).'7 
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Certainly this is a natural question for anybody interested in getting at 
things from ‘below’. Here it is to be remarked that the mathematical 
apparatus of [3] gets its most natural interpretation in terms of the question 
above. Feferman starts with the function 1+.«, call it Vie and enumerates 
in Ap the critical points of 4? for y < f, that is 


Apo = wf +a. 


Defining Aja = 420, we have Aja = w*, Ala = e, etc. Feferman uses only 
the stretch preceding the first critical number I, of 42, but, for the 
mathematical process described, this is by no means a natural stopping place; 
and one of his contributions to the present conference gives a far reaching 
extension (by use of higher types) which not only doesn’t go beyond the 
idea of the process mentioned, but doesn’t exhaust it. 

We shall not pursue this idea here, but rather analyze a notion of proof 
for which I is significant. This asymmetry between (i) the significance of 
the methods of proof (or: the natural context of the methods) and (ii) the 
results established is common enough; as somebody said one need only 
think of the geometrical significance of theorems in n-dimensional geometry 
for n > 4 say. In fact, it goes without saying (because it has been said often 
enough) that the discovery of elegant mathematical methods needed to deal 
with a subject matter, may use ideas not implicit in the latter. From the 
present point of view, Feferman’s use of the process mentioned to deal with 
the ‘small’ segment < Ig, is a successful application of this truism within 
metamathematics itself; thereby avoiding one of the horrors of proof 
theory described in note 1 on p. 507. 


4. Ordinals implicit in the classical logical operations on sets of natural 
numbers: boolean operations, cartesian products, permutations and identifi- 
cation, projection (= numerical quantification and, in particular, the set 
of all natural numbers'’). The notion of proof here considered should be 
compared to what is called ‘predicative’ or ‘predicative relative to the notion 
of natural numbers’ in the recent literature; mainly of Feferman and 
Schiitte following proposals in [14]. The principal points that seem to me 
to have been neglected, are these. i 

(i). The notion of set (of natural numbers) or, formally, the use of 
unramified free variables for sets, far from being a defect or a mere technical 
device, is an integral part of the notion considered. To use a hackneyed 
phrase: it’s not the notion of set, but what you do with it that’s problematic. 


498 G. KREISEL F XXXI 


An immediate corollary is that the notion of ordinal in the sense of a well 
founded relation can be expressed directly (cf. note 9, p. 509). Similarly, we 
have ordinal functions in the sense of: preserving well foundedness. 

Examples: To justify: «— 2* we use, for any given set X, a suitable X,, 
defined essentially just by projection, such that 


[(Induction on « applied to X,) > (Induction on 2” applied to X’)]. 


Plainly, since we assume numerical quantification to be understood, 
we can form X,, defined from X by w-iterations of our basic operations and 
justify: «a — e,. 

(ii). Though of course a principal problem is the formulation of definition 
principles for our notions, one mustn’t forget that something, more specifi- 
cally, some set of natural numbers is being defined:'? 

The critical open problem now is this. Since we are considering ordinals 
in sense (i), iteration processes have to be justified, not directly, but via 
well foundedness; in particular, in connection with the ramified hierarchy, 
that is the transfinite iteration of our basic operations on sets, induction 
has to be applied to the property P: 

the (formal) definitions at a level of the hierarchy considered are 
understood if our basic concepts are understood.?° 
Though P may be thought of as a property of numbers (or, if preferred, 
Gédel numbers) there is no stage in our reflection on the concepts where 
we can assume P to be generally understood; cf. 2 ‘n the preceding section. 
The obvious solution is this: 

Recall that having proved at stage « that f is well-founded in sense (i) 
(formulated with free set variables, cf. note 9, p. 509), we use infinite trees of 
ordinal £ as formal definitions (at stage B of the hierarchy). At each node 
is a formula, either infinite or its finite code (cf. note 20, p. 510). The build-up 
is ‘locally correct’, that is the formula at a node AN is built up from the 
formulae at the immediate predecessors of N by means of the classical logical 
operations listed in the section heading. There are two questions: 

(a). Is the final formula, say F, qua (infinite) syntactic object, determined? 

(b). Is F a good definition? 

As to (a), well-foundedness in sense (i) is (more than) enough; we need it 
only for a suitable specific arithmetic set (coding the sequence of sub- 
formulae of F) in place of the free variable X. As to (b) the truth of well- 
foundedness is certainly not involved (cf. last sentence of note 9, p. 509). Since 
we do not have an explicit definition for P to substitute for X, it seems reason- 
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able to suppose that the formal derivation of the well-foundedness of p is 
needed for (b); specifically, we expect to use the derivation as a (naturally, 
infinite) schema which need be applied only to instances of P whose meaning 
is determined at stage «.”’ In other words the (infinite) formal derivation 
would be used analogously to a piece of quantifier free algebra for strictly 
finitist applications, that is as a schema for purely numerical substitutions 
(cf. [13], p. 242, 4). 


5. Autonomeus progressions. Having looked at the ordinals, we now turn 
to proofs, more precisely to other concepts and principles of deductions 
implicit in given concepts. A first step is to describe the latter by a 
(recursive or non recursive) hierarchy of systems made up of finite formulae 
and finite derivations. It is only a first step; or, if preferred, we have a finite 
description of infinite (extensional) configurations which represent more 
faithfully the objects under study, namely intuitive proofs.*? I shall confine 
myself to a few examples of the kinds of results one has for the notions 
discussed in (a) section 3 and (b) section 4. 

(a). w-iteration and w-sequences. Since our notions are ‘logic free’, the 
language used is quantifier-free; the essentials are therefore definitions 
principles. The most obvious use to be made of the ordinals is for definition 
by recursion. Let the system P, express principles of definition by recursion 
on we '%,23 

(i). Clearly, if P, is justified, we have seen that each formal theorem of 
P, is valid (on our principles). Formally, we should be able to prove the 
reflection principle for P,, which requires a valuation function for P,. This 
is in fact available in Pz for some 8 > «. (The details are familiar.) 

(ii). We have also a converse to (i), namely if we build up a hierarchy 
P,, where P,,, is obtained by adding the canonical valuation function for 
si Rca ene 

a<é a<eo 
see e.g. [16], p. 172. The converse is plausible in the sense that the ‘most’ 
one can expect of P; (or of any formal system!) is that all its formal theorems 
are valid.?* 

(iii). We may add free variables f for w-sequences,’*® and corresponding 
definition principles for operations on them (functionals). The property of 
being well founded can now be expressed by 


mo + Wich) 
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for some functional constant t. We then get the results stated in note 16, 
p. 510. For reference, call this progression P,’. 

(iv). In, so to speak, an opposite direction, we may use the same language 
for all systems considered, say £, (‘E’ for existential), using only rudi- 
mentary functions and a constructive existential quantifier as in [14] p. 
2937°; E,,, is obtained from E, by adding (in place of the valuation func- 
tion) the reflection principle for E, by use of the satisfaction relation for 
E,. This is possible since, as observed by Myhill [20], the satisfaction 
relation for a purely existential language can be defined in the language 
itself. 


OP and eRe? 

axe a<£0 
are then equivalent in the obvious sense (in particular for any theorem 
4xA(n, x) of the latter there is a function term f, such that A(n, fn) is 
proved in some P,). 

(v). In terms of (iv) we then get an easy comparison with (classical or 
intuitionistic) first order arithmetic. For each rudimentary A let A’ be its 
canonical translation (unique up to provable isomorphism) in first order 
arithmetic Z,. Then 


dxA(n, x)e\) E, if and only if 4xA’(n, x) €Z, 
a<e£p 

(b). Classical logical operations on sets of natural numbers. The reader is 
referred to [4] for a survey of corresponding results. Ramified hierarchies 
with and without free set variables correspond to (iii) and (i) respectively. 
Hierarchies based on the Z}-rule of choice together with the reflection 
principle correspond (more or less) to (iv).*’ 

Remark. ‘Autonomous progression’ means: progression using auton- 
omous ordinals, since we first pick out the ordinals (by considerations 
proper to the concepts treated) and then build up the progression. It turns 
out as a theorem that these ordinals are equivalent to those which can be 
proved in the progression to be well founded; cf. (a) (iii) above. But, as 
stressed at length at the end of section 4, we cannot use these proofs directly 
to justify the iteration of the progression.?® 


6. Proofs: infinite representations and their finite descriptions. Recall the 
basic issue at the end of section 3 (in particular, note 15, p. 509): why can’t 
we infer Q(lim w,) from the truth of VnQ(w,)? This issue has a perfect 
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parallel for any of the progressions in the preceding section; using, say, P,’ 
of (a)(iii), we can define the proof predicates P{’ or better P, (for variable 
n) in Pg; and using O to express well foundedness (in the language of P’’) 


we can explicitly give a function x in P¢ and establish, for variable n: 
Pan, O,), 


where O,, is a term which gives the Godel number of a formula expressing 
well foundedness of w,4,.7” Hasn’t this established VnQ(w,) in our pro- 
gression? 

The obvious answer is this. Though we have described the proof predicates 
Py,,, there is no stage (of our reflection process) where we know that they 
are sound for our notion of proof. The most we can say is that the formal 
derivations appear to us ‘good’. More explicitly, let us recall the Jong and 
tortuous process by which we convince ourselves that the formal derivations 
of each P,,, provide proofs implicit in the notions treated; a formal derivation 
in P,,, in no way reflects this process which is, after all, the object under 
study. 

To put this answer into more ‘mathematical’ (quantitative) form we simply 
require more explicit representations of the process in question, and propose 
to use infinite proof figures, cf. note 22, p. 511. Let us write A, for the 
corresponding progression. 

Just how explicit should the representation be? Evidently, if we are to read 
the result ‘straight off’ the representation, we'd never get off the ground. 
More precisely, suppose the representation of the proof justifying P,’, 
that is the definition principle by recursion on w®'*, is so detailed that an 
enumeration of all w®’*-recursive functions is, say, primitive recursive®° 
in the (infinite figure representing the) given proof; then trivially the represen- 
tation is no simpler than w®’*-recursion. But let us remember our basic 
starting point: w-iteration is taken to be understood, to be regarded as a 
single act; it follows that a representation will be explicit enough (for 
somebody capable of such acts) if it symbolizes a sequence of w copies by 
a single element, and such a representation of a proof justiying P,’ can be 
defined by x-recursion.*’ But the whole sequence Ff, (for n = 1,2,...) 
cannot even be defined in any particular 7, , let alone in A). Undefinabil- 
ity results are always more convincing than unprovability! More impor- 
tant, they avoid a natural error (e.g., by Herbrand [7], p. 231 1. 20-23 or 
myself, footnote 2 on p. 242 of [13]) of supposing that the theorems implicit 
in given notions (or, e.g., the finitist theorems) cannot be recursively 
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enumerated; what is true is of course only that no such enumeration can be 
proved to be sound by means of the particular methods considered. 


7. Relation to intuitionistic mathematics (as presented e.g., in section 5 
of my other lecture). The most striking difference between the general 
intuitionistic notion of constructive proof and those considered here 
(that is, proofs implicit in the two sets of concepts of sections 3 and 4) is 
clearly this. 

In intuitionistic mathematics a proposition is understood if we have 
grasped the possibilities of proving it (at least, by irredundant or ‘normalized’ 
methods). For the notions of proof here considered, not even purely 
universal statements formulated at the lowest level, say in Py, are understood 
in this sense, because new theorems of this form appear all the way out to 
Fees 

In view of this observation the comparison between constructive and set 
theoretic methods in (a) on p. 212 of [19] should be modified as follows. 
We still make mechanical computations correspond to the hereditarily finite 
sets. But notions of proof implicit in such concepts as considered in the 
present paper correspond to ramified versions of the cumulative hierarchy 
up to ordinals implicit in the definition principles used for the ramification.?? 
Intuitionistic proofs correspond then to the cumulative hierarchy itself. 
Specifically, just as there is no set of ‘all’ sets, so we have no grasp of ‘all’ 
proofs or ‘all’ constructions; but just as for any given set there is a set of all 
its subsets, so for any given construction we have a grasp of proofs about 
it (for more detail, see the discussion of grasped domains in my other 
lecture).** 

Certainly there is something depressing about the comparison. After all, 
the current search for axioms of infinity is very naturally thought of as: 
looking for the ordinals implicit in the power set operation; and why 
should the problem mentioned in the digression at the end of section 3 be 
any easier? Or, again, in 1908 Zermelo [33] gave a discussion of the notion 
of set and some important properties, his axioms; but it was only 22 years 
later, in [34], that this great logician was really able to say clearly what 
notion he was talking about (incidentally, with very little help from the 
many pages of ‘technical’ papers that had appeared in the intervening years). 
Is there anything to encourage us? 

Well, since the question assumes the comparison, let’s also assume it 
in the answer! Then, trivially, we start where the pioneers in set theory 
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left off. A quite different point is that here we want to know about construc- 
tions which in turn refer only to our own constructions and reflections on 
this process. I still don’t see (cf. [17], p. 267, 1, 18-19) any guarantee for 
easy (self) knowledge; but it seems reasonable to take a chance. 


TECHNICAL NOTES 


I. Background on ordinals. The purpose of this note is to summarize the 
most important familiar distinctions both in (a) set theoretic and (b) 
intuitionistic mathematics. 

(a). Set theory. The notion of transfinite iteration is used in the analysis 
of the cumulative hierarchy itself (which is the principal interpretation of 
current formal set theories [34]). It is therefore, perhaps, not too surprising 
that we can ‘recapture’ quite faithfully this notion in terms of the notions 
of set theory itself. 

Case 1: ordinals built up by the process of forming successors a (of any 
given ordinal) and the supremum ga’ of a set of ordinals. Any building up 
process induces a (partial) ordering, say <, where x < y if x occurs at 
a stage in the process of building up y. To speak of a ‘building up’, clearly 
(i) ox # y for all y < x; and to speak of the supremum o’x (whose con- 
struction involves only the elements of the set x of ordinals), (ii) if y < o’x, 
for some (ordinal) ue x we must have y < w. It is natural, to satisfy (i), 
to form ox by operating on the set {y: y < x} (for the relation < to be 
constructed!); but it is not altogether obvious that there is a uniform way 
of defining o which ensures (i); von Neumann discovered 


ax =X Ug ae 


it is, in fact, the successor operation forced upon one if < is to be member- 
ship. To satisfy (ii), one puts 
gee | yo 


if one wants the supremum of a set with a largest element z to be z, and 


ox =| |i ayeye x) 
if one wants it to be az. (If x has no largest element the two definitions are 
equivalent.) Note that (iii) if x and x’ are equivalent in the sense that 


Vudvluex > (vex'au <v)|, VWodulvex’ > (uexan <u) 


then o’x and o’x’ are actually identical. 
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A critical point here is that ox is of higher type than x. There is certainly 
no apparent way of getting ‘uniform’ (e.g., set theoretically definable) 
c and o’, and <, satisfying (i)-(iii) which stay at some restricted type; 
and it is consistent (with the usual set theoretic principles) to assume that 
there is none. 

Case 2: ordinals treated as order types satisfying some condition of well 
foundedness such as 

(i). The least element principle. 
(ii). The principle of proof by transfinite induction, that is for each 
formula Fx with free variable x (and not containing y) 


Wx[(Vy < x)Fy > Fx] > VxFx 7°. 


(iii). Finiteness of descending sequences (from w into the field of <). 
Clearly, the ordering built up in case 1, satisfies (i)-(iii). The converse 
involves a quite independent principle.*® In short, the remark on p. 493 
on different notions of set theoretic ordinal was justified. 

(b). Intuitionistic mathematics. Though the basic distinctions which arise 
here are remarkably similar to the set theoretic case, the proper emphasis is 
different. (I use [8] as a convenient reference.) 

Case 1: ordinals built up by the process of forming successors and suprema 
of w-sequences.?” These were originally introduced by Brouwer by means 
of higher type operations, indeed by use of species, but as observed in [8], 
they can be reduced in an obvious sense to constructive orderings of the 
natural numbers.?° Each of these orderings has a constructive characteristic 
function, a constructive successor and constructive so called fundamental 
sequences. 

Case Z: ordinals treated as order types; it will also be important to dis- 
tinguish between decidable order types (with a decidable characteristic 
function and decidable field) and general order types (determined by a 
species, that is a relation which is in general undecidable, but which can be 
proved to be a partial ordering?’). 

Here the distinctions corresponding to (i)-(iii) in (a) are more funda- 
mental. Thus (i) is useless, and (ii) is, for decidable <, stronger than (iii) 
if only constructive descending sequences are meant since formulae F define, 
in general, species which simply do not have a constructive characteristic 
function. Brouwer introduced 

(iii’). all descending freely chosen sequences are finite. 

Brouwer’s socalled bar theorem asserts, for decidable*® < satisfying (iii’), 
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that < can be mapped by a constructive function in to an ordinal built up 
as in case I, preserving order, and hence satisfies (ii). It is evident that (iii’) 
alone does not ensure such a mapping onto an ordinal of Case 1, because 
the successor need not be constructive. 


Il. Primitive recursive arithmetic: PRA. For a description see, e.g., Hilbert- 
Bernays, vol. 1. The purpose of this note is to consider whether PRA 
corresponds to a philosophically* significant notion of proof. Its immediate 
appeal (established by the empirical fact that people take to it) shows of 
course that it corresponds to the immediately evident part of such a notion 
and not, for instance, to a// that is implicit in some given concepts. 

1. In terms of the present paper, a plausible analysis is this. PRA corre- 
sponds to the principles of proof implicit in w-iteration and specific basic 
functions (that is, in contrast to section 3, the notion of -sequence is not 
assumed to be understood; only some concatenation operations). Since, 
as is well known, w® is the limit of the ‘provable ordinals’ of PRA the 
critical point is to discover the ordinals implicit in the concepts above. Let 
ta be the limit of the ordinals built up at stage a of our reflection process. 
Since we apply w-iteration only to specific configurations, not to any 
configuration, we have, for * denoting the successor (as in § 2), 


ta* = ta+m@ with tl =o. 


Assuming the formation of unions (which needs further analysis) we have 
ta = w-a (a copies of w). Clearly w® is inaccessible, since, for a < w®, 
alsom:a < @®. 

2. The only serious*? attempt in the literature to find philosophical 
significance for PRA seems to be Tait’s discussion in [31]. Its central point 
seems to be that the evidence of each proof considered has, in some essential 
way, a strictly finite character. As it stands the analysis in [31] 1s uncon- 
vincing*? since the understanding of any one rule goes beyond this. And 
using a formalism containing only constants for rules, that is, not showing 
variable arguments, does not alter this fact. 

3. Finitist proof (finiter Beweis), as described in the introduction to 
volume 1 of Hilbert-Bernays. (As noted in several places, Hilbert’s own 
publications are not consistent, in fact, his aims in metamathematics were 
not.**) This description seems to indicate a fairly well defined area of 
mathematics, which uses w-iteration only of the particular construction of 
the number series by concatenation operations, and so, by (1), PRA may 
indeed correspond to the notion intended there. 


NOTES 


1. More technical defects which have spoilt proof theory for a long time, are due to the 
following ‘dilemma’. Formal systems which codify actual reasoning well, such as (subsys- 
tems of) classical analysis or set theory do not lend themselves to proof theoretic analysis 
without ad hoc tricks. Those systems which do, are often formal theories of hierarchies such 
as ramified analysis in [24]; usually hopelessly removed from actual practice. Evidently 
the rational step (cf., e.g., [18], p. 345, 1.17-19) is to use an auxiliary model theoretic 
reduction of ‘natural’ to ‘artificial’ systems which preserves a large class of theorems; 
beautiful applications of this procedure are in papers by Feferman [5] and Friedman [6]. 
Surely, one reason for the failure of using this method is sheer ignorance: either of the 
model theory involved or of the fact (cf. note II of [18] for details) that these reductions 
generally provide finitist proofs of conservative extension results. But perhaps a deeper, 
if unconscious, obstacle is the philosophical view that ‘nothing is gained’ by the use of 
transcendental (model theoretic) methods even as far as intelligibility is concerned. By 
accepting instead of testing this view its adherents make sure that indeed nothing is gained 
from their (non-existing) knowledge of transcendental notions! 


2. The reader familiar with mathematical practice will find there related problems in 
making objective the choice between alternative notions and definitions, for instance the 
separation between a structure-mére and obviously minor variants. Equally important, 
the crude chase after ‘bigger ordinals’ has its parallel in the vulgar idea ‘continuous func- 
tions are better than polynomials (say in finite fields!) because there are more of them’. 
Somebody, [ think rightly, said about ordinals in proof theory: It’s not how large you 
make them, it’s how you make them large. Having recognized the general nature of the 
project, we are faced with a further difficulty which also has its parallel in ‘ordinary’ axiom- 
atic mathematics: though, perhaps surprisingly, few structures (in algebra or proof theo- 
ry: orderings together with suitable ordinal functions) turn out to be basic, recondite con- 
siderations and delicate distinctions may be needed to discover them. 


3. Another familiar axiomatic analysis distinguishing between algebraic and analytic 
methods goes back to Sturm’s famous work. His own purpose in giving an algebraic 
proof of his theorem (valid, as we should say, for all real closed fields) was to avoid con- 
tinuity considerations which, with his background, were associated with dubious infinites- 
imals. In other words, his principal interest was of course not any extra generality; he 
wanted theorems about real numbers, but using only those methods of proof which he 
could see to be valid. 

Digression. Paradoxically, that is, despite its heterodox presentation, the paper of Yese- 
nin-Volpin [32] seems dominated by the preoccupation of current axiomatic mathematics 
here discussed, namely to analyze distinctions between notions of proof in terms of valid- 
ity for different kinds of objects, for instance different ‘number series’. Whatever (alleged) 
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ambiguities or weaknesses the usual concept of natural number may have, surely none of 
the various different ‘number series’ of which Yessenin-Volpin talks, is any clearer! (The 
same applies of course to non-standard models.) Putting it bluntly, the critique of the usual 
concept serves, consciously or unconciously, as a kind of diversionary tactic, drawing 
attention away from the weakness of the new concepts. Even worse (in my opinion), 
quite basic questions raised by the new concepts make perfectly good sense even if the usual 
concept is used. First, there is the question to what extent our notion of set is derived from 
the notion of finite set; and hence (presumably) the distinction between finite and in- 
finite set is derived from the distinction between small and large numbers. Second, there 
is the question of a coherent theory of this distinction, in particular of the logical laws 
obeyed by propositions involving the concept of Jarge and small number. Clearly, the law 
of the excluded middle does not apply! But isn’t it one of the more vulgar errors to suppose 
that therefore intuitionistic logic applies? 

4. Psychologically, my original formulation makes some sense. It separates proofs in- 
volving a, possibly complicated, combination of notions actually named in T, from proofs 
making essential use of novel definitions. Needless to say there is another kind of reasoning 
which has some psychological interest and which may lend itself to rather crude formal 
characterizations: what is immediately clear or evident. (Any theory on the teaching of, 
say, some elementary part of mathematics to children, involves, implicitly or explicitly, 
ideas on the question above.) Though this matter is probably of limited /ogical interest, 
it seems plausible that the apparatus of mathematical logic may be useful for (stating psy- 
chological) theories about it. 


5. Before Gédel’s decisive results the following p/ausibility considerations were valid. 
First, given any structure, say arithmetic, there are, trivially, facts about this structure, which 
can only be stated by means of notions that are not prima facie connected with it (e.g., the 
relation between the structure and quite ‘different’ notions). Second experience had shown 
that certain facts can be proved intelligibly only by use of extraneous notions (e.g., replac- 
ing familiar functions of analytic number theory by means of rational approximations pro- 
duces unintelligible constructions). These considerations made it plausible that other facts 
about the structure could not be proved (convincingly) at all without use of properties of 
such notions. 


6. Ido not know any systematic exposition of the current ideas for analyzing theoretical- 
ly the nature of problems in number theory, but found Swinnerton-Dyer’s remarks [27] 
very congenial. As to the (heuristic) value of the old questions it is hard to say whether 
or not it is supported by experience in number theory. Obviously, the illiterate formulation 
mentioned early on is repulsive to a thoughtful person; but the people concerned are igno- 
rant of the logic which may be needed for a more satisfactory formulation, and therefore 
in no position to judge the matter. 


7. This principle of induction is, classically, equivalent to the socalled: least-element-prin- 
ciple (LEP) 
VX{(Ax@ eX) > ax[xe XA Wy < x) = WEX)]}. 


This formulation is not useful, intuitionistically; more important, the principle of induction 
is immediately evident from the build-up of the integers, while LEP is derived. 


8. Take a primitive recursive An, for which we don’t know whether VxAx holds. Our 
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ordering is according to magnitude for {<x, y>:x SnAy <n} if (Vx < n)Ax; if no 
is the least x for which —, Ax holds, we put a first, and the usual ordering for {<x, y>: 
x Z#nj,y ~ng}. (A first element is given with the building-up process.) 


9. For reference below two points should be noticed. First, since the variables ¥ and f 
occur purely universally in assertions of well-foundedness, formally they may be replaced 
by free variables; hence, if well-defined ranges for the variables x, y and n are understood, 
these assertions correspond to (assertions expressed by) free variable formulae in, say, 
primitive recursive arithmetic when the latter are interpreted without assuming a well- 
defined set of all natural numbers. Second, if no definite ranges are assumed for, say, x 
and y, we have a completely quantifier free ru/e: suppose, for some specific mapping t of 
the ordering into itself, we have derived: (tx < x >txEX) > x EX, i.e., we have 
tx EX > xeEX and (tx < x) > xe X; then weinfer: x ¢ X. Naturally this ‘reduction’ 
is of interest only if we have given a suitable (non-classical) interpretation to free variable 
formulae. 


10. Since the early days of logic it has been clear that a stronger, or more general state- 
ment can often be formulated in a more elementary way! Thus, as Hilbert-Bernays, vol. 2, 
emphasize, implications Vx Ax — Vy By (with primitive recursive A and B) are not fi- 
nitistically meaningful; for given t : A(ty) ~ By, with free variable y, is. Also, in the lan- 
guage of first order predicate logic we cannot characterize say the field of rationals (and 
hence not prove results about if). But we can express the notion of formally real field, 
prove general results about the latter, and ‘thus’ results about the field of rationals. 


11. For an example, see [18], p. 336, also in connection with note 9 above. Roughly 
speaking (or precisely in terms of the next subsection), if an ordering « is built up by the 
processes here considered, so is 2%; but if (under minimal assumptions on finitistically 
decidable sets X in the example mentioned) the induction rule of note 9 has been justified 
for «, it does not necessarily hold for 2%; for other examples, see [4], p. 134. 


12. We do not go here into the relations between (i) our way of expressing the build-up 
of the given ordering and (ii) the principles of proof by induction and definition by recur- 
sion (or the relations between the latter, cf. [15]). Formally, the simplest and, possibly, 
most satisfactory way of expressing the relation is by means of an inference rule; but for 
an analysis of the nature of this inference some new idea should be used. 

13. ‘Extended’ by use of the operations explicitly given (in accordance with the idea of 
regarding ordinals as algebraic structures). Note that for general orderings the mapping 
must be relative to given 2, 4’ since the segment {x: x < &,} may permit an (order pre- 
serving) automorphism. Conversely multiplication by w preserves non-existence of non- 
trivial automorphisms. For detail, see [3] and footnote 11 on p. 339 of [18]. 


14. The corresponding orderings are unique up to isomorphism (built up from the func- 
tions involved in the algebraic structures associated with @,). 

Correction. In several places, e.g. [16], p. 164, 3.331, I have drawn attention to such 
intrinsic characterizations of the ordinals < ¢9, using as one of the functions: « + w* 
without restriction ona. This is pointless, as observed earlier; what is reasonable, for each 
@®, with n = p-+1 say, is: « > w* for x < w,, and say: « > , for other «, and this is in- 
deed sufficient for the required isomorphism result. 


15. By way of illustration, it may be useful to remember that all the functions and re- 
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cursion equations involved can be equally obtained for (suitable) orderings which are not 
well orderings at all, and if la| is not well ordered it makes no good sense to say that 
lexp,, a| has been ‘built up’ by iterating anything |a| times! (The easiest way of getting such 
orderings is to consider non-standard models of arithmetic, with the formal definition, 
unique up to provable isomorphism, of the natural ¢9 orderingand the functions mentioned. 
Of course all this is on/y an illustration because here we take the notion of w-iteration to be 
understood in its proper ‘standard’ sense.) Naturally such properties of ordinal functions 
express the intended fact, namely the possibility of building up the ordering by means of w- 
iteration, only if one understands the latter process. 

Correction. The misuse of verifiable in [2] pp. 284-286, e.g. 3.8, is evidently due to dis- 
regarding the facts just mentioned; the formal proof (even in the elementary system Q) 
that a given function satisfies equations having the form of definitions by recursion tells 
us very little, in general not even uniqueness (though for special types of equations it tells 
us uniqueness with respect to cerfain variations! cf. [15]). 


16. As by-products we shall get formal counterparts of both intuitively obvious facts 
(e.g. that our orderings are well founded in a suitable sense) and somewhat surprising con- 
verses (e.g. that orderings which can be proved to be well founded by the principles con- 
sidered can always be mapped by a function of our hierarchy in an order preserving way 
into one of our w,). Needless to say, detailed ‘technical’ refinements can have great 
philosophical value for establishing the limit eg on a ‘minimum’ of hypotheses. Putting it 
crudely: the more detail we have for our w, the easier it is to show that there is no €9 with 
the same sort of detail! The idea here is this. Simple results on an intensional concept C 
need only few details, given by a crude homomorphic image Re of C, (e.g. the order 
type Ro of a given ordering O). Later R¢- is enriched by other (relevant) structure of C. 


17. The reader will now see the force of the apparently innocent restriction (on p. 489) 
to basic concepts which are not too self reflexive. 


18. In contrast to the preceding section, I shall here give a sketch only. The purpose is 
to draw attention to the kind of technical refinement that is significant (cf. note 16 above). 
The existing relevant work of Feferman and Schiitte is so thorough that one expects to 
read off the additional information provided only one knows what to look for. 


19. Thus we are not concerned with systems in which a// objects are intended to be def- 
initions (or, more precisely, intuitive rules or attributes) in contrast to, e.g., [24], chapter 
VIII and XI or Krasner [12]. In particular, self application does not arise since we consid- 
er sets (or properties) of certain individuals, namely natural numbers. Formally speaking, 
little is gained at the present time by embedding, say, arithmetic in a theory of attributes: in 
contrast to set theory the general principles which we can assert about attributes are so 
weak that special existential axioms seem to be needed even for arithmetic; cf. [14] § 33. 
See also note 27, p. 511. 


20. Specifically, the formulae are meant to define sets (of natural numbers) if they contain 
no free set variables and properties of sets, if they do; cf. note 9, p. 509. For the discussion 
below it is best, I think, to regard (finite) formulae of the ramified hierarchy as codes for 
infinite formulae built up from symbols for the basic concepts considered in this section. 

21. It seems likely that the work of Feferman and Schiitte ‘contains’ all the formal de- 
tails needed (cf. note 18 above); the principal problem is conceptual: to formulate properly 
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just what details are needed. (Compare: sometimes quite elementary calculations in the 
theory of addition of integers generalize to all groups; the main discovery may then be the 
observation that they generalize; the calculations correspond to the formal proof theoretic 
details in the comparison.) 

Digression. A different way of proceeding is to analyse the logical laws which are valid, 
in conjunction with the particular notions here assumed, for incompletely defined proper- 
ties such as P. At the present time this is not promising; cf. note 19. 

Correction. It is sometimes pointed out (e.g., [25]) that the proofs of well-foundedness 
only use intuitionistic logic. This is pointless for the present purpose (at least, without fur- 
ther analysis) since the intended, highly impredicative, interpretation of the intuitionistic 
logical operations (for which Heyting’s rules are valid) has nothing to do with incompletely 
defined properties; cf. end of note 3. 


22. In footnote 8 of [1] Brouwer stresses that proofs are to be conceived as transfinite 
iterations of basic steps. Prof. van Heijenoort has drawn my attention to Zermelo’s pa- 
pers [35] and [36], which also insist on the infinitistic character of proofs, see particularly 
[36], p. 145 top. What is simply thoughtless is the negative attitude towards Gédel’s in- 
completeness theorems which seems to go with this insight into the character of proofs 
(e.g. [35], p. 87, but not repeated in [36]). First of all, properly interpreted, Gédel’s theo- 
rems can be used to support this insight, just as they are used to refute Hilbert’s assumption 
that finite formal derivations reflect faithfully the structure of mathematical reasoning. 
Second, from a practical point of view, formal systems may be a very good description of 
what people are likely to prove. 


23. For a description, see e.g. [15] or, in more detail, [29]. The choice of w@* instead 
of say « is technical; it ensures that P, is properly included in Pz 41. 


24, Just as consistency is the ‘least’ one expects. (Incidentally I do not know if, for Py’, ; 


consisting of P;’ together with the sentence asserting the consistency of P)’, hee a < Eo} 
is the primitive recursive part of |_J{P,: « < €0}.) 


25. Gédel has stressed this point in conversation; indeed the obviously finitist character 
of this addition. 

Correction. In [14], p. 295, I say it would be ‘desirable’ to do so without being in the 
least clear on the point, in fact the formal rules (i) and (ii) which I set up in order to avoid 
the use of such variables are quite unsatisfactory; cf. [18], p. 336. Tait [30], p. 175, last 4 
lines, rejects the use of such variables out of hand. 


26. But omitting the rule for introducing function symbols; they are quite pointless, in 
the presence of an existential quantifier. 


27. Roughly, arithmetic relations correspond to rudimentary ones (and it would per- 
haps be of interest to investigate the correspondence). Provably definite properties of sets 
here correspond to provably recursive ones in (a). One difference between Feferman’s 
unramified hierarchies and (iv) is that he also considers theorems which are not 2}; for 
such theorems we do not get an unambiguous correspondence to the ramified hierarchy 
but only ‘interpretations’; naturally since there are logical theorems 3XA(X), for A ¢2) 3 
such that no set in the ramified hierarchy can be proved to satisfy A (when the quantifiers 
in A range over all sets), cf. [18], p. 349, last line. However, certain striking results do in- 
volve arbitrary formulae; transfinite induction can be proved for all formulae on any or- 


512 G. KREISEL F XXxI 


dinal (in sense (i) of Section 4) [3] though not every formula defines a set; all (provably) 
definite formulae are equivalent to Aj -formulae [4] while only the converse is obvious. 
The significance of these formal results for the particular notions of section 4 remains to 
be analyzed. 

Digression. The provably definite properties were not introduced ad hoc, but correspond 
to a quite central aim, already formulated (informally) by Poincaré: to look for invariant 
(sometimes called: extensionally definite) definitions, that is formulae which define a de- 
finite object so to speak ‘before’ one has made up one’s mind on possible extensions of a 
given universe; and this idea is formulated in terms of extensions of a given model (only 
very general axiomatic conditions being imposed on these extensions). Now though it is 
clear that Krasner’s interests [12] in ‘tests’ go in the same direction, it is possible that his 
notion is actually a bit different from invariance in the sense above. As explained above in 
note 19, Krasner’s notion is not promising if one is principally concerned with sets of in- 
tegers. In any case, at the present time it does not compete with the notion of invariance 
which (i) is quite precise, (ii) has been thoroughly investigated, (iii) has been found to 
have very satisfactory closure properties and (iv) wide generalizations in model theoretic 
generalizations of recursion theory. 

Correction. While, clearly, definitions implicit in the notions of Section 4 must be prova- 
bly definite in a suitable hierarchy satisfying the autonomy condition used, it is not apparent 
that Poincaré’s aim requires such autonomy. So, without further analysis, Feferman’s re- 
sults [4] on provably definite formulae should be regarded as elegant stability properties 
of his unramified systems, and the latter as a kind of metatheory for the ramified hierarchy. 


28. Correction. This presents a change of view; when the term ‘autonomous progres- 
sion’ was introduced, it was thought that the formal machinery of say a was needed in 
an essential way to obtain the ordinal w,,,.In the system [16], p. 172, a symbol O is indeed 
added to the language to express: can (be proved to) be reached by w-iteration. However, 
it is treated as a pseudo property and the relevant result 3.422 uses nothing of the formal 
machinery. 


29. Specifically, a suitable term 7 in a for the natural ordering < of w,,;, in the no- 
tation of section Sa(iii). 


30. ‘Primitive recursive in’ is here taken as a translation of ‘read straight off’. 


31. The reader should recall an analogous (and, I believe, much neglected) situation 
in the theory of cut free infinite proof figures. If we are allowed to infer 3x Ax only from 
At for some term f¢, cut free derivations corresponding to formal first order number 
theory cannot in general be defined primitive recursively; if x Ax is inferred from At 
Vv dx Ax, they can. Put in present terms, proof figures using the latter formal rule conceal 
the steps involved in understanding the formal derivation as a proof. 

Correction. In [16], p. 165, 3.3321, I had in mind the formal fact associated with the two 
rules; my observation was a prize example of pointlessness since I did not realize any 
significance in the formal fact. 


32. In a very strict sense, not even numerical statements are understood in this sense, 
because we can’t even define the sequence P,: « < e9. However, a consistency assumption 
is enough to justify a particular definition of irredundant proof, namely: computation in 
the #, in which the symbols used first occur. 
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Digression. It seems to me that the following mathematical problem may be of some inter- 
est. Let us use the standard definition of the intuitionistic logical operations relativized 
to proofs (and functions) in one of our autonomous progressions of formal theories; in 
this case decidability of the proof predicate is of course unproblematic. What are the result- 
ing formal logical laws? Does cut play a special role? Are there reasonable variants of such 
relativizations? (These questions belong to the theory of hereditarily formalizable proofs.) 


33. In Gédel’s constructible hierarchy, as he repeatedly emphasized, there is no connec- 
tion between the definition principles (first order quantification) and the ordinals used. 
Consequently, with ‘definitions’ in place of ‘proofs’ there is, e.g., an obvious meaning of 
irredundant definition of a set of integers for which all sets of integers in the hierarchy can be 
defined before a certain ordinal, the first constructibly uncountable one. But it seems far 
fetched to regard it as implicit in the definition principle. 


34. The fact that, as pointed out in my other paper, certain very elementary proofs are 
evidently applicable without restriction to grasped domains, correspondsto such elementary 
set theoretic operations as the boolean operations which evidently apply without careful 
consideration of what objects sets are. 

Digression. \t cannot be stressed too much that Heyting’s interpretation of the logical op- 
erations (from which the presentation in my other paper is derived) is abstract and problem- 
atic. This fact may not be quite irrelevant to Brouwer’s notorious objections to Heyting’s 
formalization. Whatever Brouwer’s conscious reasons may have been, he may well have 
had doubts about what exactly is asserted in general logical laws. It seems quite absurd to 
suppose that he couldn’t have made up for himself all the well-known bromides about the 
‘innocence’ of formalization (how it ‘clarifies’ matters, or helps ‘communication’). In 
addition, he would have been right in expecting popular misinterpretation of the sig- 
nificance of Heyting’s particular rules since, at the time, there was no indication at all that 
they were complete for the intended meaning; in this connection, see note I of my other 
paper. 

35. (i) and (ii) are equivalent in classical, but not in intuitionistic /ogic. In axiomatic 
set theory, or, more explicitly, in ramified set theories where not every definable property 
has an extension, (ii) may be stronger than 

Vz{vx[Wy < x)(vy ez) > x Ez] > Vx(« € z)}. 

36. Precisely, while the derivation of (i) for von Neumann’s orderings or the equivalence 
in note 35 above holds for every level of the cumulative hierarchy, the converse is an exis- 
tential theorem on the number of levels to be used. (The replacement axiom is a familiar 
sufficient condition for the converse, that is for the existence of a von Neumann ordering 
isomorphic to any given well-founded ordering.) 

37. In [16], 2.8, p. 147~148 (cf. 2.812) also suprema of longer sequences are considered 
(but given by constructive operations). 

38. In contrast to the set theoretic case 1, 1 know no way, even by use of higher types, 
of satisfying condition (iii) in (a) above. 

39. Evidently, if it can be proved to be a total ordering in the sense VxVy(x < yv 
x= yVy<-x) and to have a decidable field, then < is decidable too. 


40. Using « for choice sequences, Va3n — (an* < an) does not in general ensure the exis- 
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tence of an extensional functional, say N, such that Yo — [«(Na)*] < «(Na), where 

n* = n+1. However, an extensional N can be obtained for decidable < by the device 

in the remark at the end of Technical Note I1I] in my other paper, since for such < 
Vain — (an? < on) > Vad!ln[— (ant < an) A (Ym & n)\(amt < am)], 

where < denotes order of magnitude. For more detail, see the discussion of BI and B1! 

in technical note IV of my other lecture. 

Digression (cf. [37]). For < with a possibly undecidable field F, the situation is less 
neat; (iii’): Wadn — (an* < an), called WF, in [37], is too strong. Specifically, our (ii) 
(transfinite induction) does not imply WF, nor even WF: Wadn — (Wm & n)(am* < an) 
by [37], pp. 333, 335 resp. In any case, WF, > YWxVy [T=& < y) V-—a < x)] (by 
taking «(2p) = x, «(2p+1) = y) and so < cannot be ‘too’ undecidable. 

Correction. While treating WF3 and WF,, which are intuitionistically useless by [37] 
p. 334, [37] omits the further classical equivalent 


Valvne F) > 4n - (an? < an)] 
and the (related) extended bar theorem of [17], p. 249. 


41. Mathematically speaking, there are weak subsystems of PRA which seem to be 
equally interesting; for instance the rudimentary arithmetic of Smullyan or the remarkable 
applications of Kalmar’s elementary functions by Rabin [22]; cf. also the occurrence of 
this class of functions for certain notions of complexity of computations [23]. 


42. When describing the notion of combinatorial proof in [19], App. II B, on p. 197, 
1. 11, I, inadvertently, slipped in a parenthetical restriction (to a finite number of rules) 
which, taken literally (cf., ibid., p. 214, 1.—7), would not allow us to go beyond PRA. But 
for the notion intended in [19], App. II B, the restriction is simply not justified (or else 
the proof of validity of the system /¢ is not correct as it stands!) 


43. I may be unduly skeptical because some (peripheral?) parts of the analysis are not 
thought out at all. For instance, Tait refers to [9] as a source concerning Hilbert’s own no- 
tion of finitist proof, goes on to say ‘itis difficult perhaps to determine what Hilbert really 
had in mind’ and argues that Ackermann’s enumeration of the primitive resursive functions 
is not finitist. But whatever else may be in doubt, Hilbert’s own notion as used in [9] 
certainly includes Ackermann’s function since it is explicitly mentioned! The argument that 
non-trivial mathematics can be done in PRA is weak in view of note41 above. Probably Tait 
has simply overstated a perfectly legitimate point. Also, as is constantly done in foundations 
to prepare the way for highly problematic principles (e.g. full set theory), one is tempted 
to draw a very narrow line (mechanical or formal derivations for the strict formalist) be- 
yond which everything is ‘much of a muchness’. Tait’s particular concern is the use of 
well-founded undecidable species (of a certain sort); cf. note 40. 


44. Compare the quotation from the year 1922 given in the last paragraph of the intro- 
duction above with ([10], p. 180 from 1923): ‘results of usual mathematics... not absolute 
truths, ... only those of proof theory’. It would perhaps be natural to try and reconcile 
these views by supposing that Hilbert believed that the reliability of finitist mathematics 
consists in being anschaulich (visualizable in its entirety). But Bernays has cast coubt on 
this by drawing attention to [10], pp. 162-163. Of course these inconsistencies are not sur- 
prising because (as a matter of experience) in philosophical matters often a major ad- 
vance is needed to produce as much as a semblance of coherence! 
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* The questions and conjectures on pp. 350-351 have been settled; see Prawitz [20] and_ 
technical note II of my other lecture. The paper [68] referred to on p. 362, 1. 12 seems to 
have been withdrawn; however, the general result by Pfister has been published in A. 
PrFisTER, Zur Darstellung definiter Funktionen als Summe von Quadraten, Jnventiones 
Math. 4 (1967) 229-237. 

Friedmans’ construction, referred to in footnote 34 on p. 375, has been published in: 
Bar induction and J7}-CA, J. Symb. Logic 34 (1969) 353-362. 

P. 385: [4] has appeared in 145 (1968) 10-12. 

P. 386: [44] has appeared in 11 (1967) 32-55. 
Correction. P. 371, 1. 16-17 is false. If A is 17} and the X34 sentence (AY € 2®)A is prova- 
ble in ZF, so is AX eL ~M 2@)A by Note VI bG) 1.c. But it is consistent to add to ZF the 
assumption that all constructible sets of integers are analytic [R. B. JENsEN, Notices 
Am. Math. Soc. 15 (1968) 189, 68T-6]. It seems to be open whether there is an 4 in IZ} 
(with a single free variable X) such that A(X*) is a theorem of ZF for some set theoreti- 
cally defined X*, but not for any analytic ¥*. 
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